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TRANSGENIC PLANTS CONTAINING ALTERED LEVELS 

OF STEROID COMPOUNDS 



5 TECHNICAL FIELD 

The present invention relates to biotechnology 
with an emphasis on plant biotechnology, and 
particularly biotechnology affecting the biosynthesis 
of steroid compounds . 

10 

BACKGROUND 

Enhancement of the nutritional or health benefits 
of oils through genetic engineering is being addressed 
throughout the agricultural community. Several 

15 approaches involve manipulation of already present 

cellular biosynthetic pathways. Steroid biosynthetic 
pathways are of current interest, particularly for the 
enhancement of health benefits from food oils. 

Several related U.S. patents address increasing 

20 sterol accumulation in higher plants. Those patents 
include U.S. Patent No. 5,589,619 "Process and 
Composition for increasing squalene and sterol 
accumulation in higher plants" (accumulation of 
squalene in transgenic plants by increasing HMGR 

25 activity) and U.S. Patent No. 5,306,862 "Method and 
composition for increasing sterol accumulation in 
higher plants" (increasing HMGR activity to increase 
plant sterol accumulation- -including sterol and 
cycloartenol , which affects insect resistance- - in 

30 tobacco, tomato, corn, carrot, soybean, cotton, barley, 
arabidopsis, guayule and petunia; seeds with elevated 
sterol /cycloartenol , 7S promoter and CaMV promoters), 
U.S. Patent No. 5,365,017 "Method and composition for 
increasing sterol accumulation in higher plants" (DNA 
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construct with HMGR-CaMV 3 5S, transgenic plants, hybrid 
plants, corn, soy, barley, tomato, Arabidopsis) , U.S. 
Patent No. 5,349,126 "Process and composition for 
increasing squalene and sterol accumulation in higher 
plants" (increase in squalene and sterol accumulation 
by increasing HMGR activity in transgenic tobacco, 
cotton, soybean, tomato, alfalfa, Arabidopsis, corn, 
barley, carrot and guayule plants) , and EP 486290 
(enhancement of squalene and specific sterol . [squalene 
zymosterol, cholest-7, 24-dienol, cholest-5, 7,24- 
trienol] accumulation in yeast by increasing HMGR 
activity in yeast deficient in enzymes that convert 
squalene to ergosterol) . 

In those patents, the amount of a protein 
exhibiting 3-hydroxy-3-methylglutaryl Coenzyme-A 
reductase (HMGR) activity is typically increased. HMGR 
widens a "bottleneck" near the beginning of a 
biosynthetic path to steroid production, permitting a 
higher carbon flux through steroid biosynthetic 
pathways and resulting in increased sterol 
accumulation . 

U.S. Patent No. 5,480,805 "Composition for 
modulating sterols in yeast" (enhancement of delta 8-7 
isomerase activity-ERG2 enhances accumulation of 
specific sterols in yeast) . 

U.S. 5,460,949 "Method and composition for 
increasing the accumulation of squalene and specific 
sterols in yeast" (increasing squalene, zymosterol and 
specific sterols in yeast by increasing HMGR in yeast 
having decreased ergS and erg6 activity- -Sc and hamster 
HMGR) . 

WO 9845457 (SMTI, Erg6 from A. t., corn, yeast; 
transgenic plants with altered sterol levels_using DNA 
encoding an enzyme binding a first sterol and producing 
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a second sterol - -altered carotenoid, tocopherol, 
modified FA levels- -HMGR, 5a-reductase , geranylgeranyl 
pyrophosphate synthase, phytoene synthase, phytoene 
desaturase, isopentenyl diphosphate isomerase) . 

Acetate is the metabolic precursor of a vast array 
of compounds vital for cell and organism viability. 
Acetyl coenzyme A (CoA) reacts with acetoacetyl CoA to 
form 3 -hydroxy- 3 methylglutaryl CoA (HMG-CoA) . HMG-CoA 
is reduced to mevalonate in an irreversible reaction 
catalyzed by the enzyme HMG-CoA reductase. Mevalonate 
is phosphorylated and decarboxylated to isopentenyl - 
pyrophosphate (IPP) . Through the sequential steps of 
isomerization, condensation and dehydrogenation, IPP is 
converted to geranyl pyrophosphate (GPP) . GPP combines 
with IPP to form farnesyl pyrophosphate (FPP) , two 
molecules of which are reductively condensed to form 
squalene, a 30-carbon precursor of sterols. 

A key enzyme in sterol biosynthesis is 3-hydroxy- 
3 -methylglutaryl -Coenzyme A reductase (HMG-CoA 
reductase or HMGR) . Schaller et al . (Plant Physiol. 
109: 761-770, 1995) found that over-expression of 
rubber HMGR {hmgl) genomic DNA in tobacco leads to the 
overproduction of sterol end-products (sitosterol, 
campesterol and stigmasterol ) up to 6- fold in leaves. 
Further, the excess sterol was stored as steryl -esters 
in lipid bodies. HMGR activity was increased by 4- to 
8-fold. 

Sterols are derivatives of a fused, reduced ring 
system, cyclopenta- [a] -phenanthrene , comprising three 
fused cyclohexane rings (A, B, and C) in a phenanthrene 
arrangement, and a terminal cyclopentane ring (D) 
having the formula (I) and carbon atom position 
numbering shown below: 
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where R is an 8 to 10 carbon-atom side chain. 

In plants, squalene is converted to squalene 
epoxide, which is then cyclized to form cycloartenol 
(4,4, 14a-trimethyl-9p, 19-cyclo-5a-cholest-24-en-3p-ol) . 
Cycloartenol has two methyl groups at position 4, a 
methyl group at position 14, a methylene bridge between 
the carbon atoms at positions 9 and 19 that forms a 
disubstituted cyclopropyl group at those positions, and 
includes an 8 -carbon sidechain of the formula: 
CH3CH (CH2 ) 2CH=C {CH3 ) 2 - Squalene epoxide can 
alternatively be converted into pentacyclic sterols, 
containing five instead of four rings. Exemplary 
pentacyclic sterols include the phytoalexins and 
saponins . 

Being one of the first sterols in the higher plant 
biosynthetic pathway, cycloartenol serves as a 
precursor for the production of numerous other sterols. 
In normal plants, cycloartenol is converted to 

predominantly 2 4 -methylene cycloartenol (4,4,14a- 

dimethyl-9p, 19-cyclo-22 , 23 -dihydro-ergosta-24 (28) -en-3- 

P-ol) , cycloeucalenol , (4 , 14a-trimethyl-9p, 19 cyclo-5a- 

ergosta-24 (28) -en-3P-ol) , isof ucosterol (5a-stigmasta- 

5-24 (28) -dien-3p-ol) , sitosterol (5a-stigmasta-5-en-3p- 

ol) , stigmasterol- (stigmasta-5 , -22 -dien-3p-ol ) , 
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campesterol (5a-ergosta-5-en-3p-ol) , and cholesterol 
(5a-cholesta-5-en-3p-ol) . These transformations are 
illustrated in Fig. 1. 

Although sterols produced by plants, and 
particularly higher (vascular) plants, can be grouped 
by the presence or absence of one or more of several 
functionalities, plant sterols are classified into two 
general groups herein; i.e., those containing a double 
bond between the carbon atoms at positions 5 and 6 
(delta- 5 or A5 sterols) and those not containing a 
double bond between the carbon atoms at positions 5 and 
6 (non-delta-5 sterols) . 

Exemplary naturally-occurring delta-5 plant 
sterols are isof ucosterol , sitosterol, stigmasterol , 
campesterol, cholesterol, and dihydrobrassicasterol . 
Exemplary naturally occurring non-delta-5 plant sterols 
are cycloartenol , 24 -methylene cycloartenol , 
cycloeucalenol , and obtusif oliol . The most abundant 
sterols of vascular plants are campesterol, sitosterol, 
and stigmasterol, all of which contain a double bond 
between the carbon atoms at positions 5 and 6 are 
classified as delta-5 sterols. 

The HMG-CoA reductase enzymes of animals and 
yeasts are integral membrane glycoproteins of the 
endoplasmic reticulum. The intact enzyme comprises 
three regions: a catalytic region containing the active 
site of the enzyme; a membrane binding region anchoring 
the enzyme to the endoplasmic reticulum; and a linker 
region joining the catalytic and membrane binding 
regions of the enzymes. The membrane binding region 
occupies the amino- terminal (N-terminal) portion of the 
intact protein, whereas the catalytic region occupies 
the carboxy- terminal (C- terminal) portion of the 
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protein, with the linker region constituting the 
remaining portion. M.E. Basson et al . , Mol . Cell 
Biol. , 8(9) :3797 -3808 (1988) . 

The activity of HMG-CoA reductase in animals and 
yeasts is known to be subject to feedback inhibition by 
sterols. Such feedback inhibition requires the 
presence of the membrane binding region of the enzyme . 
See, e.g., G, Gil et al. Cell, 41:249-258 (1985); M. 
Bard and J.F. Downing, J". Gen. Microbiol., 124:415-420 
(1981) . 

Given that mevalonate is the precursor for sterols 
and other isoprenoids, it might be expected that 
increases in the amount or activity of HMG-CoA 
reductase would lead to increases in the accumulation 
of both sterols and other isoprenoids. 

In mutant strains of the yeast Saccharornyces 
cerevisiae (S. cerevisiae) having abnormally high 
levels of HMG-CoA reductase activity, the production of 
two sterols, 4 , 14 -dimethyl zymosterol and 14- 
methylf ucosterol is markedly increased above normal. 
Downing, et al . , Biochem. Biophys. Res. Co/nm. , 94(3): 
874-979 (1980) . 

When HMG-CoA reductase activity was increased by 
illumination in non-photosynthetic microorganisms, 
isoprenoid (carotenoid) , but not sterol (ergosterol) , 
synthesis was enhanced. Tada, et al . , Plant and' Cell 
Physiology, 23 (4) : 615-621 (1982). 

WO 9703202 discloses a method for identifying 
agents modulating sterol biosynthesis using a yeast 
acetoacetyl CoA thiolase (ERGIO) gene linked to a 
reporter system to evaluate compounds, such as 
lovastatin and other HMG-CoA synthase inhibitors, that 
affect cholesterol biosynthesis. 
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U.S. Patent No. 5,668,001 teaches a recombinant 
avian HMG-CoA synthase preparation useful for 
evaluating drugs that inhibit cholesterol biosythesis. 

JP 09121863 discloses a plant with increased 3- 
hydroxy- 3 -methyl glutaryl coenzyme A reductase (HMGR) 
activity as a result of increasing the expression of a 
mutant protein kinase gene that regulates expression of 
the HMGR gene. The increased HMGR activity increases 
squalene, zymosterol, cholesta-7 , 24 -dienol and cholest- 
5, 7 , 24-trenol accumulation in yeast with ERGS and ERG6 
mutants . 

EP 480730 " Plant-sterol accumulation and pest 
resistance-by increasing copy number of 3-hydroxy-3- 
methyl glutaryl coenzyme -A reductase gene in tobacco, 
tomato and corn 

WO 9913086 "Human Delta 7-sterol reductase 
polypept ide-usef ul for diagnosis or treatment of 
genetic defects e.g. hereditary Smith-Lemli -Opit z 
syndrome" teaches making and using the recombinant 
polypeptide with humans. 

Chappell et al . U.S. Patent No. 5,589,619 teaches 
that transformation of higher plants with truncated 
HMG-CoA reductase enhanced the production of squalene, 
cycloartenol and certain sterols, particularly 
compounds having unsaturations at the 5 -position. 
Several intermediate sterols as are shown in Fig. 1 
were also produced. It would be beneficial if the 
production of sitosterol and stigmasterol could be 
enhanced while lessening the accumulation of the 
intermediate sterols. The present invention provides 
avenues for enhancing production of sitosterol and 
stigmasterol and lessening the accumulation of the 
intermediate sterols . 
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Gonzalez et al . (Abstract of poster at Third 
Terpnet Meeting of the European Network on Plant 
Isoprenoids, May 29-30, 1997, Poitiers, France) over- 
expressed the Arahidopsis HMGR cDNA {hmgl and hmg2) and 
found sterol overproduction with hmgl only. They used 
two forms of the hmgl gene, a full-length form and a 
truncated form containing only the catalytic domain. 
HMGRs have three domains, an N- terminal membrane 
spanning domain, a short linker domain, and a C- 
terminal catalytic domain. In this case the transgenic 
plants were also Arahidopsis . The difference between 
the full-length and truncated forms was a greater 
accumulation of pathway intermediates in the case of 
the truncated form. More importantly, the intermediates 
demonstrated as accumulating were cycloartenol , 24- 
methylenecycloartanol and obtusif oliol . 

Finally, US patents 5,365,017 and 5,306,862, both 
assigned to Amoco Corp., disclose a method for 
increasing sterol accumulation in plants by increasing 
the copy number of a gene having HMG-CoA reductase 
activity. These inventions disclose a method using 
hamster truncated HMGR that consisted of the catalytic 
domain and the linker domain. According to the claims 
the linker domain was essential for activity. They also 
demonstrated a greater accumulation of pathway 
intermediates such as cycloartenol . 

BRIEF SUMMARY 

The present invention relates to transgenic plants 
and their progeny having improved nutritional 
characteristics. More particularly, the present 
invention relates to transgenic plants and their 
progeny, the storage organs (e.g. seed, fruit and 
vegetable parts) of which contain modified levels of 
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Steroid compounds, such as (i) elevated levels of 
beneficial phytosterols (e.g., sitosterol), 
phytostanols (e.g., sitostanol) , and esters thereof, 
relative to an otherwise identical plant transformed 
5 only with a truncated HMG-CoA reductase gene or a wild- 
type plant, and (ii) reduced levels of steroid pathway 
intermediate compounds (e.g. one or more of squalene, 
cycloartenol , 24 -methylene cycloartenol , obtusif oliol , 
stigmasta-7-enol and campesterol) in their storage 

10 organs relative to an otherwise identical transgenic 
plant transformed only with a truncated HMG-CoA 
reductase gene. Nucleic acid sequences encoding 
enzymes that affect the biosynthesis and accumulation 
of steroid compounds in plants (HMG-CoA reductase and a 

15 steroid pathway enzyme) , and methods for using these 
sequences to produce such transgenic plants, are also 
provided. These methods comprise, for example, 
introducing into cells nucleic acid sequences encoding 
enzymes that affect the levels of accumulated steroid 

20 pathway end products. 

The present invention contemplates a recombinant 
construct or a recombinant vector that contains 2 DNA 
sequences. The first encodes a polypeptide exhibiting 
3 -hydroxy-3-methylglutaryl -Coenzyme A (HMG-CoA) 

25 reductase activity. The second DNA sequence encodes a 
polypeptide exhibiting the activity of another steroid 
pathway enzyme. Each polypeptide -encoding DNA sequence 
is operably linked in the 5' to 3 * direction to a 
promoter and a transcription termination signal 

30 sequence independent of the other sequence . The 
promoter is located upstream and the termination 
sequence downstream of each polypeptide-encoding DNA 
sequence. The second DNA sequence encoding a steroid 
pathway enzyme can code for a squalene epoxidase 
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enzyme, a sterol methyl transferase I enzyme, a sterol 
C4-demethylase enzyme, a obtusifoliol C14a-demethylase 
enzyme, a sterol C5-desaturase enzyme, or a sterol 
methyl transferase II enzyme. It is contemplated that 
HMG-CoA reductase and the steroid pathway enzyme 
activity comes from a mutant or truncated form of those 
enzymes, such as a truncated HMG-CoA reductase lacking 
the transmembrane region while retaining a functional 
catalytic domain. Examples of such preferred HMG CoA 
reductases include the truncated rubber and Arabidopsis 
HMG CoA reductases disclosed herein. 

Preferably, the regulatory function of a promoter 
is substantially unaffected by cellular levels of 
squalene such as the CaMV 3 5S promoter. In one aspect, 
a promoter is seed-specific. In another aspect, a 
promoter is derived from a species in a different order 
from a host cell. In another aspect, the HMG-CoA 
reductase or steroid pathway enzymes is from a species 
in a different order from the order that of the host 
cell. The invention contemplates a construct or 
* recombinant vector having more than one DNA sequence 
encoding a steroid pathway enzyme that do not have to 
be under the control of the same promoter. Preferably, 
a recombinant vector is a plant expression vector. 

In another aspect of the invention, a transformed 
host cell comprises a recombinant construct or vector 
as described above. Preferably, a host cell is plant 
cell, preferably that plant cell is from canola, 
soybean, corn, maize, tobacco, cotton, rape, tomato or 
alfalfa. The invention contemplates a host cell in a 
cell culture, plants derived from transformed host 
cells, and storage organs (seeds, fruits and vegetable 
parts) from transgenic plants. 
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In addition to contemplating transgenic plants and 
seeds, the invention contemplates transgenic plant 
seeds capable of germinating into a transgenic plant 
and mutants, recombinants, genetically engineered 
derivatives thereof and hybrids derived therefrom. The 
plant over-accumulates steroid pathway products 
relative to a native, non- transgenic plant of the same 
strain, wherein said mutants, recombinants, genetically 
engineered derivatives thereof and hybrids derived 
therefrom maintain the ability to overaccumulate 
steroid pathway products. 

The invention contemplates a process of increasing 
the formation of steroid pathway products in a 
transformed host cell as compared to an otherwise 
identical non- transformed host cell. Contemplated 
processes use the described recombinant constructs and 
vectors to transform host cells, then growing the host 
cells or regenerating transgenic plants therefrom. 

In one aspect of the invention, the genome of a 
contemplated plant, its progeny, seeds or cell culture, 
comprises introduced DNA encoding an HMG-CoA reductase 
activity and introduced DNA encoding a steroid pathway 
enzyme that is a squalene epoxidase enzyme, a sterol 
methyl transferase I enzyme, a sterol C4 -demethylase 
enzyme, a obtusifoliol C14a-demethylase enzyme, a 
sterol C5-desaturase enzyme, or a sterol methyl 
transferase II enzyme. The storage organs of such a 
plant contain an elevated level of total accumulated 
sterol, compared to storage organs of an otherwise 
identical plant, the genome of which does not comprise 
said introduced DNA. Further, the storage organs of 
the plant contain a reduced level of squalene, 
cycloartenol , 24 -methylene cycloartenol , obtusifoliol , 
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stigmasta-7-enol , or campesterol compared to the seeds 
of an otherwise identical plant or a plant comprising 
an introduced DNA encoding an HMG-CoA reductase enzyme. 

The invention contemplates a method of producing a 
plant that accumulates an elevated level of sterol 
pathway products compared to a corresponding plant 
comprising no introduced DNA encoding a peptide, 
polypeptide, or protein that affects the biosynthesis 
and accumulation of a sterol pathway product, 
comprising sexually crossing plants to arrive at a 
plant comprising nucleic acid encoding an HMG CoA 
reductase and a steroid pathway enzyme, including 
crosses with a nurse cultivar. The plants, including 
apomicitic plants, uniform populations of the plants 
and their seeds and parts other than seeds are 
contemplated. 

Another aspect of the invention is oils containing 
at least one sterol pathway product, extracted from the 
seeds of a contemplated plant. Preferably sitosterol, 
at least one sitosterol ester, or mixtures thereof, 
comprise at least about 57% by weight of the total 
sterol compounds of a contemplated oil. Preferably 
sitosterol, that at least one sitosterol ester, or 
mixtures thereof, comprise at least about. 0.08% of the 
dry weight of a contemplated seed. Preferably, the oil 
has a reduced amount of squalene, cycloartenol , 24- 
methylene cycloartenol , obtusif oliol , stigmasta-7-enol , 
campesterol, or combinations thereof, compared to oil 
from a corresponding transgenic plant that does not 
contain introduced DNA encoding a squalene epoxidase 
enzyme, a sterol methyl transferase I enzyme, a sterol 
C4-demethylase enzyme, a obtusif oliol C14a-demethylase 
enzyme, a sterol C5-desaturase enzyme, a sterol methyl 
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transferase II enzyme, or mixture thereof; wherein the 
reduction is in the range of from about 10% to about 
100% . 

Sitosterol ester compositions derived from 
transgenic plants of the present invention, their 
progeny or their seeds are also contemplated, 
preferably wherein an esterifying fatty acid has 2 to 
2 2 carbon atoms in the main chain. 

A further aspect of the invention is cholesterol- 
lowering compositions comprising contemplated oils and 
sitosterol ester compositions. Another further aspect 
of the invention is foods, food ingredients, or food 
compositions comprising contemplated oils. 

Still further, the invention contemplates 
pharmaceutical compositions comprising a cholesterol - 
lowering effective amount of a contemplated oil, and a 
pharmaceutically acceptable carrier, excipient, or 
diluent . 

A method of lowering the plasma concentration of 
low density lipoprotein cholesterol is contemplated, 
comprising orally administering to a human or animal 
subject an effective amount of an above composition. 
Also contemplated is a method of treating or preventing 
an elevated plasma concentration of low-density 
lipoprotein cholesterol, comprising orally 
administering to a human or animal subject an effective 
amount of a contemplated composition. 

A related aspect of the invention is a method of 
making a food additive composition, comprising 
obtaining oil containing a sterol pathway product 
compound from seed of a contemplated transgenic plant 
and mixing the oil with an edible solubilizing agent, 
an effective amount of a dispersant, and optionally, an 
effective amount of an antioxidant. 
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Novel forms of two sterol pathway enzymes and the 
nucleic acids that encode them are disclosed: an 
Arahidopsis enzyme having nucleic acid similarity to a 
squalene epoxidase, and an Arabidopsis enzyme having 
nucleic acid similarity to an obtusifoliol C14a- 
demethylase enzyme. Thus, the invention contemplates 
an isolated DNA molecule having a nucleotide sequence 
of disclosure SEQ ID NO: 4, 6, 8, 10, 14, 15, 17 or the 
complements thereof. Also contemplated is a nucleotide 
sequence that hybridizes to the nucleotide sequence of 
SEQ ID NO: 4, 6, 8, 10, 14, 15, 17 or their complements 
under a wash stringency equivalent to 0 . 5X SSC to 2X 
SSC, 0.1% SDS, at 55-65°C, and that encode a 
polypeptide having squalene epoxidase or obtusifoliol 
C14a-demthylase enzymatic activity. Preferably, that 
enzymatic activity is substantially similar to that of 
a disclosed squalene epoxidase or obtusifoliol C14a- 
demethylase, respectively. By substantially smiliar is 
meant having enzymatic activity differing from that 
of the disclosed enzymes by about 3 0% or less, 
preferably by about 20% or less, and more preferably 
by about 10% or less when assayed by standard 
enzymatic assays. Also contemplated is a nucleotide 
sequence encoding the same genetic information as said 
nucleotide sequence of SEQ ID NO: 4, 6, 8, 10, 14, 15, 
17 or their complements or that hybridize as described 
above, but which is degenerate in accordance with the 
degeneracy of the genetic code. Recombinant 
constructs, vectors and transformed host cells 
comprising the novel isolated and purified nucleic acid 
sequences are also contemplated. In one embodiment, 
the vector is a plant vector and the host cell is a 
plant cell. Methods of producing the disclosed 
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squalene epoxidase or obtusifoliol C14a-demethylase 
enzymes are also contemplated comprising culturing a 
transformed host cell for a time and under conditions 
conductive to the production of the squalene epoxidase 
or obtusifoliol C14a-demethylase enzyme, and 
recovering the produced squalene epoxidase or 
obtusifoliol C14a-demethylase enzyme . 

Yet another aspect provides any of the above 
described transformed host cells, further comprising a 
recombinant construct or expression vector encoding a 
tocopheral synthesis pathway enzyme, and in particular, 
S - adenosy Imet hionine - dependent a - tocopherol 
methyltransf erase . Also included are plants, seeds and 
storage organs comprising the transformed host cells. 

Another aspect provides, a process of increasing 
the formation of steroid pathway products and 
tocopherols in a transformed host cell as compared to 
an otherwise identical non- transformed host cell 
comprising ( 1) transforming a host cell with a 
recombinant vector comprising (a) as operably linked 
components in the 5* to 3 ' direction, a promoter, a DNA 
sequence encoding a first polypeptide having 3-hydroxy- 
3 -me thylglutaryl -Coenzyme A reductase enzyme activity, 
and a transcription termination signal sequence; and 
(b) as operably linked components in the 5' to 3 ' 
direction, a promoter, a DNA sequence encoding at least 
one polypeptide having steroid pathway enzyme activity 
selected from the group consisting of squalene 
epoxidase enzyme activity, sterol methyl transferase I 
enzyme activity, sterol C4 -demethylase enzyme activity, 
obtusifoliol C14a-demethylase enzyme activity, sterol 
C5-desaturase enzyme activity, and sterol methyl 
transferase II enzyme activity, and a transcription 
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termination signal sequence; (2) transforming the host 
cell of (1) with a recombinant vector comprising as 
operably linked components, a promoter, a DNA sequence 
encoding a tocopherol synthesis pathway enzyme, and a 
transcription termination sequence; and (3) 
regenerating said transformed plant cell into said 
transgenic plant . 

BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features, aspects, and advantages 
of the present invention will become better understood 
with regard to the following description, appended 
claims and accompanying figures where: 

Fig. 1 is an abbreviated version of a plant 
steroid compound biosynthetic pathway that shows the 
enzymes affecting steroid compound biosynthesis and 
accumulation. These include: HMG-CoA reductase, 
squalene epoxidase, sterol methyl transferase I, sterol 
C4 -demethylase , obtusif oliol C14a-demethylase , sterol 
C5 desaturase and sterol methyl transferase II. 

Fig. 2 depicts the forms of Ara.bidopsis and rubber 
HMGRl tested in Arsthidopsis and yeast to compare 
expression, activity and sterol production. 

Fig. 3 is a map showing the structure of construct 
pMON29920. pMON29920 is a binary transformation vector 
with P-7S/E9 3' cassette and the KAN gene flanked by 
the two borders where P-7S is the promoter of alpha' 
beta conglycinin protein from soybean, E9 3' is the 3' 
end of pea rbc E9 gene and KAN is the coding sequence 
for NPTII that confers resistance to kanamycin. The 
NPTII gene is driven by the 3 5S promoter from 
cauliflower mosaic virus. Spc . Str is the coding region 
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for Tn7 adenylyltransf erase conferring resistance to 
spectinomycin and streptomycin; ori-V: the vegetative 
origin of replication; rop : coding region for repressor 
of primer; ori-322: minimum known sequence required for 
a functional origin of replication; NOS 3': the 3' 
termination end of nopal ine synthase coding region. 

Fig. 4 is a map showing the structure of construct 
pMON43800. pMON43800 is a recombinant binary vector 
for Agrobacterium-mediated transformation, carrying the 
rubber HMGRl gene cassette. The HMGRl gene is driven 
by the 7S alpha' beta conglycinin promoter from 
soybean. P-7S: 7S promoter, rubber HMGRl gene: coding 
sequence for 3 -hydroxy-3 -methylglutaryl reductase from 
Hevea brasiliensis ; E9 3': 3' end of pea rbcS E9 gene; 
P-35S: 35S promoter from cauliflower mosaic virus; KAN: 
coding region for NPTII gene conferring resistance for 
kanamycin; NOS 3': 3' termination end of nopaline 
synthase coding region: Left Border: Octapine left 
border from Octapine Ti plasmid pTiA6 ; ori-V: the 
vegetative origin of replication; rop: coding region 
for repressor of primer; Spc/Str: coding region for Tn7 
adenylyltransf erase conferring resistance to 
spectinomycinand streptomycin . 

Fig. 5 is a map showing the structure of construct 
pMON23616. pMON23616 is a plant expression vector 
containing P-NOS/ORF-7/KAN/NOS-3 ' cassette. P-NOS: NOS 
promoter from Agrobacterium tumefaciens pTiT37; ORF-7: 
a short open reading frame that attenuates expression 
of KAN in plants; KAN: coding sequence of NPTII gene 
that confers resistance to kanamycin and neomycin; ble: 
confers resistance to bleomycin; NOS 3': 3' termination 
end of nopaline synthase coding region; Left Border: 
Octapine left border from Octapine Ti plasmid pTiA6 ; 
ori-V: the vegetative origin of replication; rop: 
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coding region for repressor of primer; Spc/Str: coding 
region for Tn7 adenylyltransf erase conferring 
resistance to spectinomycin and streptomycin. 

Fig. 6 is a map showing the structure of construct 
pMON43818. pMON43818 is a recombinant binary vector 
carrying the gene encoding rubber hydroxymethyl 
glutaryl CoA reductasel (HMGRl) in sense orientation 
driven by the soybean alpha' beta conglycinin promoter. 
P-NOS: nopaline synthase gene promoter; kan: coding 
region for neomycin phospho transferase protein to 
confer resistance to kanamycin; NOS 3': 3' termination 
end of nopaline synthase coding region; Soy Alpha' Beta 
Conglycinin: 7S alpha' beta conglycinin gene promoter 
from soybean; Rubber HMGRl gene: coding sequence for 
HMGRl gene from Hevea brasiliensis ; E9 3' : 3' end of 
pea rbcS E9 gene; Left border: octopine left border, 
sequence essential for transfer of T-DNA into 
Agrobacterium; ori-V: plasmid origin of replication in 
Agrohacterium; rop : coding sequence for repressor of 
primer; Ori-322: origin of replication in E.coli; 
Spc/Str: coding region for Tn7 adenylyltransf erase 
(AAD(3")) conferring resistance to spectinomycin and 
streptomycin; Right Border: right border sequence of T- 
DNA essential for integration into Agrobacterium. 

Figure 7 is a map showing the structure of 
construct pMON43052. pMON43052 is a recombinant 
shuttle vector, carrying the cDNA fragment encoding the 
catalytic domain of Arabidopsis HMGRl in sense 
orientation driven by the soybean alpha' beta 
conglycinin promoter. P-NOS: nopaline synthase gene 
promoter; kan: coding region for neomycin 
phosphotransferase protein to confer resistance to 
kanamycin; NOS 3': 3' termination end of nopaline 
synthase coding region; Soy Alpha' Beta Conglycinin: 7S 
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alpha' beta conglycinin gene promoter from soybean; 
Arabidopsis HMGR catalytic domain: coding sequence for 
the catalytic domain of Arahidopsis HMGRl protein; E9 
3': 3' end of pea rbcS E9 gene; Left border: octopine 
left border, sequence essential for transfer of T-DNA 
into AgrojbacteriuiTj; ori-V: plasmid origin of 
replication in AgroJbacteriLim; rop : coding sequence for 
repressor of primer; Ori-322: origin of replication in 
E.coli; Spc/Str: coding region for Tn7 

adenylyltransf erase (AAD (3 " ) ) conferring resistance to 
spectinomycin and streptomycin; Right Border: right 
border sequence of T-DNA essential for integration into 
Agroba. c t eri um . 

Figure 8 is a map showing the structure of 
construct pMON51850. pMON51850 is a binary vector for 
Agrojbacteriuin mediated transformation of soybean. P- 
NOS : nopal ine synthase gene promoter; kan: coding 
region for neomycin phosphotransferase protein to 
confer resistance to kanamycin; NOS 3': 3' termination 
end of nopaline synthase coding region; Left border: 
octopine left border sequence essential for transfer of 
T-DNA into Agrobacterium; ori-V: plasmid origin of 
replication in AgrroJbacterium; rop: coding sequence for 
repressor of primer; ori-322: origin of replication in 
E.coli; Spc/Str: coding region for Tn7 

adenylyltransf erase (AAD (3")) conferring resistance to 
spectinomycin and streptomycin; Right Border: right 
border sequence of T-DNA essential for integration into 
Agrr'oJbac teri um . 

Figure 9 is a map showing the structure of 
construct pMON43 057. pMON43057 is a recombinant binary 
vector for AgroJbacteriu/n mediated transformation of 
soybean, carrying the gene cassette for expressing 
catalytic domain of HMGRl from Arabidopsis thaliana. 
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The catalytic domain of the HMGRl cDNA is driven by 
soybean 7S alpha' beta conglycinin promoter. P-NOS: 
nopal ine synthase gene promoter; kan: coding region for 
neomycin phosphotransferase protein to confer 
resistance to kanamycin; NOS 3': 3' termination end of 
nopaline synthase coding region; Left border: octopine 
left border sequence essential for transfer of T-DNA 
into Agrobacterium; ori-V: plasmid origin of 
replication in Agrobacteriumj rop : coding sequence for 
repressor of primer; ori-322: origin of replication in 
E.coli; Spc/Str: coding region for Tn7 

adenylyltransf erase (AAD(3")) conferring resistance to 
spectinomycin and streptomycin; Right Border: right 
border sequence essential for transfer of T-DNA into 
Agrojbacterium; Soy Alpha' Beta Conglycinin: soybean 7S 
alpha' beta conglycinin gene promoter; Arahidopsis 
HMGR catalytic domain: coding sequence for Arahidopsis 
HMGRl catalytic domain; E9 3' : 3' end of pea rbcS E9 
gene . 

Figure 10 is a map showing the structure of 
construct pMON43058. pMON43058 is a recombinant binary 
vector for AgroJbacteriujn-mediated soybean 
transformation, carrying gene expression cassettes for 
catalytic domain of HMGRl from Arahidopsis thaliana and 
SMTII from Arabidopsis thaliana. P-NOS: nopaline 
synthase gene promoter; kan: coding region for neomycin 
phosphotransferase protein to confer resistance to 
kanamycin; NOS 3': 3' termination end of nopaline 
synthase coding region; Left border: octopine left 
border sequence essential for transfer of T-DNA into 
Agrohacterium; ori-V: plasmid origin of replication in 
Agrohacterium; rop: coding sequence for repressor of 
primer; ori-322: origin of replication in E.coli} 
Spc/Str: coding region for Tn7 adenylyltransf erase 
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(AAD(3")) conferring resistance to spectinomycin and 
streptomycin; Right Border: right border sequence 
essential for transfer of T-DNA into Agrobacterium; Soy 
Alpha' Beta Conglycinin: 7S alpha' beta conglycinin 
gene promoter from soybean; Arabidopsis HMGR catalytic 
domain: sequence encoding the catalytic domain of 
Ar-ajbidopsis HMGRl ; E9 3': 3' end of pea rbcS E9 gene; 
Soy Alpha' Beta Conglycinin: soybean 7S alpha' beta 
conglycinin gene promoter; Arabidopsis SMT2 : cDNA 
encoding sterol methyl transferase II enzyme from 
Arabidopsis thaliana (accession no: X89867) ; NOS 3': 3' 
termination end of nopaline synthase coding region. 

Fig. 11 is profile (histogram) of the sterol 
composition of Rl transgenic soybean seeds when 
Arabidopsis truncated HMGR (catalytic domain without 
linker) was overexpressed using seed-specific 7S 
promoter (data from pMON43057 :p7S : : At HMGR truncated). 

Fig. 12 is a profile (histogram) of the sterol 
composition of Rl transgenic soybean seeds when 
Arabidopsis truncated HMGR (catalytic domain without 
linker) and Arabidopsis SMTII were overexpresed (data 
from pMON43058 :p7S : : At HMGR truncated and p7S::At 
SMTII) . The expression of the genes is controlled by 
the seed-specific 7S promoter. 

Figure 13 is a map showing the structure of 
construct pMON53733. pMON53733 is a recombinant binary 
vector carrying the cDNA encoding full-length form of 
Arabidopsis hydroxymethyl glutaryl CoA reductasel 

(HMGRl) in sense orientation driven by the enhanced 
cauliflower mosaic virus 35S promoter. P-35S: 35S 
promoter from cauliflower mosaic virus; kan: confers 
resistance to neomycin and kanamycin; NOS 3' : 3' 
termination end of nopaline synthase coding region; 
Left border: octopine left border, sequence essential 
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for transfer of T-DNA into Agrohacterium; ori-V: 
plasmid origin of replication in AgrohsLcterium; rop: 
coding sequence for repressor of primer; ori-322: 
origin of replication in E.coli; Spc/Str: coding region 
for Tn7 adenylyltransf erase (AAD(3")) conferring 
resistance to spectinomycin and streptomycin; Right 
Border: right border sequence of T-DNA essential for 
integration into Agrobacterium; P-e35S: enhanced 
cauliflower mosaic virus promoter; Arabidopsis HMGRl : 
cDNA sequence encoding full-length form of Arabidopsis 
HMGRl; E9 3' : 3' end of pea rbcS E9 gene. 

Figure 14 is a map showing the structure of 
construct pMON53734. pMON53734 is a recombinant binary 
vector carrying the cDNA encoding catalytic domain with 
linker region of Arabidopsis hydroxymethyl glutaryl CoA 
reductasel (HMGRl) in sense orientation driven by the 
enhanced cauliflower mosaic virus 35S promoter. P-35S: 
3 5S promoter from cauliflower mosaic virus; kan: 
confers resistance to neomycin and kanamycin; NOS 3': 
3' termination end of nopaline synthase coding region; 
Left border: octopine left border, sequence essential 
for transfer of T-DNA into Agrrojbacterium; ori-V: 
plasmid origin of replication in Agrobacterium; rop: 
coding sequence for repressor of primer; ori-322: 
origin of replication in E.coli; Spc/Str: coding region 
for Tn7 adenylyltransf erase {AAD(3")) conferring 
resistance to spectinomycin and streptomycin; Right 
Border: right border sequence of T-DNA essential for 
integration into AgroJbacteriun?; P-e35S: enhanced 
cauliflower mosaic virus promoter; Arabidopsis tHMGRl : 
cDNA sequence encoding catalytic domain with linker 
region of Arabidopsis HMGRl; E9 3': 3' end of pea rbcS 
E9 gene . 
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Figure 15 is a map showing the structure of 
construct pMON53735. pMON53735 is a recombinant binary 
vector carrying the cDNA encoding catalytic domain 
without the linker region of Arabidopsis hydroxymethyl 
glutaryl CoA reductasel (HMGRl) in sense orientation 
driven by the enhanced cauliflower mosaic virus 3 5S 
promoter. P-35S: 35S promoter from cauliflower mosaic 
virus; kan: confers resistance to neomycin and 
kanamycin; NOS 3': 3' termination end of nopaline 
synthase coding region; Left border: octopine left 
border, sequence essential for transfer of T-DNA into 
Agrobacterium; ori-V: plasmid origin of replication in 
Agrobacterium; rop; coding sequence for repressor of 
primer; ori-322: origin of replication in E.coli; 
Spc/Str: coding region for Tn7 adenylyltransf erase 
(AAD(3")) conferring resistance to spectinomycin and 
streptomycin; Right Border: right border sequence of T- 
DNA essential for integration into Agrobacteriuiu; P- 
e35S: enhanced cauliflower mosaic virus promoter; 
Arabidopsis cHMGRl : cDNA sequence encoding catalytic 
domain without the linker region of Arabidopsis HMGRl; 
E9 3 ' : 3 ' end of pea rbcS E9 gene . 

Figure 16 is a map showing the structure of 
construct pMON53736. pMON53736 is a recombinant binary 
vector carrying the cDNA encoding full-length form of 
rubber {Hevea brasiliensis) hydroxymethyl glutaryl CoA 
reductasel (HMGRl) in sense orientation driven by the 
enhanced cauliflower mosaic virus 35S promoter. P-35S: 
35S promoter from cauliflower mosaic virus; kan: 
confers resistance to neomycin and kanamycin; NOS 3': 
3' termination end of nopaline synthase coding region; 
Left border: octopine left border, sequence essential 
for transfer of T-DNA into Agrobacterium; ori-V: 
plasmid origin of replication in AgrobactGrium; rop: 
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coding sequence for repressor of primer; ori-322: 
origin of replication in E.coli; Spc/Str: coding region 
for Tn7 adenylyltransf erase (AAD{3")) conferring 
resistance to spectinomycin and streptomycin; Right 
Border: right border sequence of T-DNA essential for 
integration into Agrobacteriumj P-e35S: enhanced 
cauliflower mosaic virus promoter; Hevea HMGRl : cDNA 
sequence encoding full-length form of rubber HMGRl; E9 
3' : 3' end of pea rbcS E9 gene. 

Figure 17 is a map showing the structures of 
construct pMON53737. pMON53737 is a recombinant binary 
vector carrying the cDNA encoding catalytic domain with 
linker region of rubber (Hevea brasiliensis) 
hydroxymethyl glutaryl CoA reductasel (HMGRl) in sense 
orientation^driven by the enhanced cauliflower mosaic 
virus 35S promoter. P-3 5S: 3 5S promoter from 
cauliflower mosaic virus; kan: confers resistance to 
neomycin and kanamycin; NOS 3' : 3' termination end of 
nopaline synthase coding region; Left border: octopine 
left border, sequence essential for transfer of T-DNA 
into Agrobacterium; ori-V: plasmid origin of 
replication in Agrobacterium; rop : coding sequence for 
repressor of primer; ori-322: origin of replication in 
E.coli} Spc/Str: coding region for Tn7 

adenylyltransf erase (AAD(3")) conferring resistance to 
spectinomycin and streptomycin; Right Border: right 
border sequence of T-DNA essential for integration into 
Agrobacterium; P-e35S: enhanced cauliflower mosaic 
virus promoter; rubber tHMGRl : cDNA sequence encoding 
catalytic domain with linker region of rubber HMGRl; E9 
3' : 3' end of pea rbcS E9 gene. 

Figure 18 is a map showing the structure of 
construct pMON53738. pMON53738 is a recombinant binary 
vector carrying the cDNA encoding mutant form of rubber 
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(Hevea brasiliensis) hydroxymethyl glutaryl CoA 
reductasel (HMGRl) in sense orientation driven by the 
enhanced cauliflower mosaic virus 35S promoter. In the 
mutant rubber HMGRl the putative phosphorylation site, 
the serine amino acid residue at position 566 is 
changed to alanine amino acid residue (SEQ ID 23) . P- 
35S: 35S promoter from cauliflower mosaic virus; kan: 
confers resistance to neomycin and kanamycin; NOS 3': 
3' termination end of nopaline synthase coding regions- 
Left border: octopine left border, sequence essential 
for transfer of T-DNA into AgroJbacteriu/n; ori-V: 
plasmid origin of replication in Agrobacterium; rop : 
coding sequence for repressor of primer; ori-322: 
origin of replication in E.coli; Spc/Str: coding region 
for Tn7 adenylyltransf erase (AAD(3")) conferring 
resistance to spectinomycin and streptomycin; Right 
Border: right border sequence of T-DNA essential for 
integration into Agrohacterium; P-e35S: enhanced 
cauliflower mosaic virus promoter; rubber tHMGRl Ala 
566: cDNA sequence encoding catalytic domain with 
linker region of rubber HMGRl in which serine amino 
acid residue at position 566 is changed to alanine 
amino acid residue using site directed mutagenesis; E9 
3': 3' end of pea rbcS E9 gene. 

Figure 19 is a map showing the structure of 
construct pMON53739. pMON53739 is a recombinant binary 
vector carrying the cDNA encoding mutant form of rubber 
(Hevea brasiliensis) hydroxymethyl glutaryl CoA 
reductasel (HMGRl) in sense orientation driven by the 
enhanced cauliflower mosaic virus 35S promoter. In the 
mutant rubber HMGRl the putative phosphorylation site, 
the serine amino acid residue at position 567 is 
changed to alanine amino acid residue (SEQ ID 24) . P- 
35S: 35S promoter from cauliflower mosaic virus; kan: 
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confers resistance to neomycin and kanamycin; NOS 3': 
3' termination end of nopaline synthase coding region; 
Left border: octopine left border, sequence essential 
for transfer of T-DNA into Agrohacterium; ori-V: 
plasmid origin of replication in AgrroJbacteriuin; rop : 
coding sequence for repressor of primer; ori-322: 
origin of replication in E.coli; Spc/Str: coding region 
for Tn7 adenylyltransf erase (AAD(3")) conferring 
resistance to spectinomycin and streptomycin; Right 
Border: right border sequence of T-DNA essential for 
integration into AgrroJbacterium; P-e35S: enhanced 
cauliflower mosaic virus promoter; rubber tHMGRl Ala 
567: cDNA sequence encoding catalytic domain with 
linker region of rubber HMGRl in which serine amino 
acid residue at position 567 is changed to alanine 
amino acid residue using site directed mutagenesis; E9 
3' : 3' end of pea rbcS E9 gene. 

Figure 20 is a map showing the structure of 
construct pMON53740. pMON53740 is a recombinant binary 
vector carrying the cDNA encoding catalytic domain 
without linker region of rubber (Hevea hrasiliGnsis) 
hydroxymethyl glutaryl CoA reductasel (HMGRl) in sense 
orientation driven by the enhanced cauliflower mosaic 
virus 3 5S promoter. P-3 5S: 3 5S promoter from 
cauliflower mosaic virus; kan: confers resistance to 
neomycin and kanamycin; NOS 3': 3' termination end of 
nopaline synthase coding region; Left border: octopine 
left border, sequence essential for transfer of T-DNA 
into AgjroJbacteriujTi; ori-V: plasmid origin of 
replication in Agrohacterium; rop: coding sequence for 
repressor of primer; ori-322: origin of replication in 
E.coli; Spc/Str: coding region for Tn7 

adenylyltransf erase (AAD(3")) conferring resistance to 
spectinomycin and streptomycin; Right Border: right 
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border sequence of T-DNA essential for integration into 
AgroJbacterium; P-e35S: enhanced cauliflower mosaic 
virus promoter; rubber cHMGRl : cDNA sequence encoding 
catalytic domain without linker region of rubber HMGRl ; 
E9 3 ' : 3 ' end of pea rbcS E9 gene . 

Fig. 21 is a graph comparing the cycloartenol 
content in micrograms of steroid compound per gram of 
seeds analyzed in transgenic Arabidopsis plants 
transformed with pMON53733 through pMON53740 compared 
to control plants. 

Fig. 22 is a graph comparing the 24 -methylene 
cycloartenol content in micrograms of steroid compound 
per gram of seeds analyzed in transgenic Arabidopsis 
plants transformed with pMON53733 through pMON53740 
compared to control plants. 

Fig. 23 is a graph comparing the obtusifoliol 
content in micrograms of steroid compound per gram of 
seeds analyzed in transgenic Arabidopsis plants 
transformed with pMON53733 through pMON53740 compared 
to control plants. 

Fig. 24 is a graph comparing the campesterol 
content in micrograms of steroid compound per gram of 
seeds analyzed in transgenic Arabidopsis plants 
transformed with pMON53 73 3 through pMON53 74 0 compared 

to control plants. 

Fig. 25 is a graph comparing the sitosterol 
content in micrograms of steroid compound per gram of 
seeds analyzed in transgenic Arabidopsis plants 
transformed with piyiON53 73 3 through pMON53 74 0 compared 
to control plants. 

Fig. 2 6 is a graph comparing the sitostanol 
content in micrograms of steroid compound per gram of 
seeds analyzed in transgenic Arabidopsis plants 
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transformed with pMON53733 through pMON53740 compared 
to control plants. 

Fig. 27 is a sterol profile (histogram) of 
transgenic Arabidopsis harboring different forms of 
5 rubber HMGR. 

Fig. 28 is a graph of the squalene, zymosterol and 
erogosterol content in micrograms of sterol per 
milligram of cell dry weight from HMGR constructs in 
yeast HMGRl knockout mutants for constructs having full 

10 length and truncated HMG CoA reductase (HMGR) 

sequences. The truncated sequences contain substantial 
portions of the catalytic region but lack the linker 
region and the transmembrane region of HMGR. These 
sequences are derived from Arahidopsis and rubber 

15 plants. 

Fig. 2 9 is a map showing the structure of 
construct pMON4 3 84 2 . pMON43842 is a yeast expression 
vector carrying cDNA encoding Arabidopsis putative 
obtusifoliol C14a-demethylase (AC002329j in sense 

20 orientation driven by the p423Gall promoter. Sc.His3: 
HIS3 region from Saccharomyces cerevisiae encoding 
imidazoleglycerol -phosphate dehydratase for hist idine 
synthesis; Ori-fl: bacteriophage fl origin of 
replication; LAC: contains partial lad coding 

25 sequence, promoter Plac, promoter Pt7, promoter Pt3, KS 
polylinker, and partial lacZ coding sequence; lacZ: 
partial coding sequence for beta-d-galactosidase or 
lacZ protein; T-Sc.Cycl: a terminator from Cycl- iso- 
1-cytochrome c from Saccharomyces cGrevisiae to 

30 terminate transcription; obtus . C14a . demethylase 
(AC002329) : cDNA encoding Arabidopsis putative 
obtusifoliol C14a-demethylase; P-Sc.Gall: a promoter 
from Gall- galactokinase of SaccharomycGs cerevisiae to 



29 MTC 6783.1 

PATENT 

direct expression with galactose induction; LacZ- 
alpha: partial coding sequence for beta-d-galactosidase 
or lacZ protein; Ori-pUC: minimum sequence required 
for a functional origin of replication, sequence 
downstream of this region is known to affect copy 
number when expressed in bacteria; AMP: contains the 
P3 promoter and the beta- lactamase coding sequence, 
conferring resistance to ampicillin, penicillin, and 
carbenicillin; Sc.2micron: 2 micron origin of 
replication . 

Fig. 3 0 is a map showing the structure of 
construct pMON43843 . pMON43843 is a yeast expression 
vector carrying cDNA encoding Arabidopsis putative 
squalene epoxidase 1 (ATA506263) in sense orientation 
driven by the p423Gall promoter. Sc.His3: HIS3 region 
from Saccharomyces cerevisiae encoding 

imidazoleglycerol -phosphate dehydratase for histidine 
synthesis; Ori-fl: bacteriophage fl origin of 
replication; LAC: contains partial lad coding 
sequence, promoter Plac, promoter Pt7, promoter Pt3 , KS 
polylinker, and partial lacZ coding sequence; lacZ : 
partial coding sequence for beta-d-galactosidase or 
lacZ protein; T-Sc.Cycl: a terminator from Cycl- iso- 
1- cytochrome c from Saccharomyces cerevisiaG to 
terminates transcription; Squalene epoxidase 1 
(ATA506263) : cDNA encoding Arabidopsis putative 
squalene epoxidase 1 (ATA506263) ; P-Sc.Gall: a 
promoter from Gall- galactokinase of Saccharomyces 
cerevisiae to direct expression with galactose 
induction; LacZ-alpha: partial coding sequence for 
beta-d-galactosidase or lacZ protein; Ori-pUC: minimum 
sequence recjuired for a functional origin of 
replication, sequence downstream of this region is 
known to affect copy number when expressed in bacteria; 
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AMP: contains the P3 promoter and the beta- lactamase 
coding sequence, conferring resistance to ampicillin, 
penicillin, and carbenicillin; Sc.2micron: 2 micron 
origin of replication. 
5 Fig. 31 is a map showing the structure of 

construct pMON43844. pMON43844 is a yeast expression 
vector carrying cDNA encoding Arahidopsis putative 
squalene epoxidase 1(ATA304243) in sense orientation 
driven by the p423Gall promoter. Sc.His3: HIS3 region 
10 from Saccharomyces cerevisiae encoding 
^ imidazoleglycerol -phosphate dehydratase for histidine 

'fl synthesis; Ori-fl: bacteriophage fl origin of 

replication; LAC: contains partial lad coding 
sequence, promoter Plac, promoter Ft 7, promoter Pt3, KS 
15 polylinker, and partial lacZ coding sequence; lacZ: 
partial coding sequence for beta-d-galactosidase or 
lacZ protein; T-Sc.Cycl: a terminator from Cycl- iso- 
fy 1 -cytochrome c from Saccharomyces cerGvisiae to 

terminate transcription; Arab, squalene epoxidase 1 
H= 20 (ATA304243) : cDNA encoding Arahidopsis putative 

squalene epoxidase 1 (ATA304243) ; P-Sc.Gall: a 
promoter from Gall- galactokinase of SaccharomycGS 
cerevisiae to direct expression with galactose 
induction; LacZ-alpha: partial coding sequence for 
25 beta-d-galactosidase or lacZ protein; Ori-pUC: minimum 
sequence required for a functional origin of 
replication, sequence downstream of this region is 
known to affect copy number when expressed in bacteria; 
AMP: contains the P3 promoter and the beta- lactamase 
30 coding sequence, conferring resistance to ampicillin, 
penicillin, and carbenicillin; Sc.2micron: 2 micron 
origin of replication. 

Fig. 32 is a comparision of known HMG CoA 
reductase amino acid sequences . ClustalW alignment of 
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forty- three non- redundant HMG-CoA reductase sequences 
to represent archaebacterial , eubacterial, fungal, 
p^nt and animal groups. The putative functional 
domains in the alignment marked as described below are 
based on the three dimnensional structure of 
PsGudomonELS mevalonii HMGR (Lawrence et al . , 1995, 
Science 268:1758): boxed -HMGCoA binding domain, light 
shade-NAD(H) binding domain, underlined consensus- 
domains involved in catalysis, * underneath consensus 
and boldface-key histidine residue involved in 
catalysis. The putative phosphorylation site residues 
are marked with t and boldface, and are located at the 
C- terminal region of the protein, adjacent to a highly 
conserved arginine, marked with t and boldface. Also 
indicated are the conserved Glu (E) , Lys (K) , and Asp 
(D) residues, marked by E, K, and D, respectively. 
These residues are thought to be critical in catalysis, 
based on the crystal structure (Tabernero et al . , 1999; 
PNAS 96 (13) : 7167-71) . 

Appendices A through C show SEQ ID Nos : 1 through 
3, respectively. Appendices D through G show SEQ ID 
Nos 2 0 thorough 23, respectively. 

DETAILED DESCRIPTION 

The following detailed description is provided to 
aid those skilled in the art in practicing the present 
invention. Even so, this detailed description should 
not be construed to unduly limit the present invention 
as modifications and variations in the embodiments 
discussed herein can be made by those of ordinary skill 
in the art without departing from the spirit or scope 
of the present inventive discovery. 

All publications, patents, patent applications, 
databases and other references cited in this 
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application are herein incorporated by reference in 
their entirety as if each individual publication, 
patent, patent application, database or other reference 
^ere specifically and individually indicated to be 
incorporated by reference. 

We have expressed the full-length forms of the 
rubber and Arabidopsis HMGRs driven by seed- specif ic 
promoters in transgenic canola and soybean. We have 
demonstrated sterol over-production up to 2-4 fold 
higher in seeds from these transgenic plants. We also 
demonstrated a higher accumulation of pathway 
intermediates in soybean than canola. These results 
were disclosed in PCT publication WO 00/61771. 
However, we have expressed a truncated form of the 
Arabidopsis hmgl without the linker and membrane 
spanning domains in Arabidopsis and soybean. The 
results in Arabidopsis were similar to that 
demonstrated by Gonzalez et al . (1997) and we compared 
the sterol profiles of our transgenic plants with those 
produced by Gonzalez et al . , using our methods to show 
they are comparable. We found the same types of pathway 
intermediates accumulating. However, in soybean seeds 
we have demonstrated the accumulation of squalene to a 
very high level (-3mg/g seed which is around 100-fold 
higher than in nontransgenic controls) . This is an 
unexpected result not disclosed or suggested in the 
prior art. Squalene is a precursor for sterols and in 
soybean it appears that there is a "^bottleneck" in the 
further conversion of this precursor to sterols. Thus, 
it appears that there could be additional ways of over- 
producing sterols in soybean to levels greater than 10- 
fold which would include combining a truncated form of 
HMGR with other genes coding for enzymes down- stream of 
squalene . 
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This opens the potential to combine other genes 
such as squalene epoxidase for further enhancing the 
levels of desirable sterols. Such a combination has 
not been disclosed or suggested in the prior art . 
5 Squalene expoxidase catalyzes the addition of oxygen to 
squalene which is a 30-carbon linear isoprenoid chain 
thus allowing for cyclization to form cycloartenol . 
Additional enzymes downstream that can be also be 
manipulated are sterol methyltransf erase 1, C-4 
10 demethylase, C-14 demethylase, sterol methytransf erase 
2, and C-5 desaturase that would all deplete other 
□ pathway intermediates shown to accumulate in soybeans. 

^ By using such strategies it is possible to convert all 

W — m 

ffl of the squalene and other intermediates to end sterols 

y E 

15 such as sitosterol, stigmasterol and campesterol , Thus, 
sterol level in soybean oils can be elevated from 0.3% 
= up to 3.5%. Expression of the full-length rubber HMGR 

in soybeans results in a sterol level increase up to 
W 2.7%. 

Q 20 Enhancement of 3 -hydroxy- 3 -methylglutaryl -CoA 

^ reductase (HMG Co-A reductase) activity in certain 

cells results in increased sterol biosynthesis. See, 
e.g. Chappell, U.S. Patent No. 5,589,619. The present 
discovery further contemplates an increase of steroid 
25 pathway end products such as A5 sterols and their 

stanol counterparts with a decreased accumulation of 
certain steroid pathway intermediates by also enhancing 
various specific steroid pathway enzyme activities, 
such that more of the steroid pathway intermediate 
30 compounds are converted to steroid pathway end 
products . 

DNA sequences encoding squalene epoxidases are 
useful for removal of squalene accumulation, genes 
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encoding sterol methyl transferase I enzymes are useful 
for removal of cylcoartenol accumulation, genes 
encoding sterol C4-demethylase are useful for removal 
of 24 -methylene cycloartenol accumulation, genes 
encoding obtusifoliol C14a-demethylases are useful for 
removal of accumulation of obtusifoliol, genes encoding 
sterol C5-desaturases are useful for removal of 
stigmasta-7-enol accumulation, and genes encoding 
sterol methyl transferase II enzymes are useful for the 
reduction of accumulated campesterol and concomitant 
increase of sitosterol . 

Levels of sitostanol and sitostanol esters can be 
elevated further by approximately 2- to 40 -fold over 
the transgenic plants of the art having only added 
genes for HMG CoA reductase by introducing additional 
genes encoding one or more of the following sterol 
pathway enzymes: a squalene epoxidase, a sterol methyl 
transferase I, a sterol C4 -demethylase , an obtusifoliol 
C14a-demethylase , a sterol C5-desaturase, a sterol 
methyl transferase II . 

As used herein, the term "structural coding 
sequence" means a DNA sequence which encodes for a 
peptide, polypeptide, or protein which may be made by a 
cell following transcription of the DNA to mRNA, 
followed by translation to the desired peptide, 
polypeptide, or protein. 

The term "sterol" as applied to plants refers to 
any chiral tetracyclic isopentenoid which may be formed 
by cyclization of squalene oxide through the transition 
state possessing stereochemistry similar to the trans- 
syn- trans -anti -trans -anti configuration, i.e., 
protosteroid cation, and which retains a polar group at 
C-3 (hydroxyl or keto) , an all- trans -anti 
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Stereochemistry in the ring system, and a side-chain 
20R-conf iguration (Parker et al . (1992) In Nes et al . , 
Eds., Regulation of Isopentenoid Metabolism, ACS 
Symposium Series No. 497, p, 110; American Chemical 
Society, Washington, D.C.). The numbering of the 
carbon atoms of a representative sterol (cholesterol) 
is shown in the following structure (FORMULA II) : 
As used herein, the term "sterol" refers to 
unsaturated hydroxyl group- containing derivatives of a 
fused, reduced ring system, cyclopenta [a] -phenanthrene , 
comprising three fused cyclohexane rings (A, B and C) 
in a phenanthrene arrangement, and a terminal 
cyclopentane ring (D) . The exemplary steroid below 
(FORMULA II) illustrates the numbering system employed 
herein in describing the location of groups and 
substituents . 




Sterols may or may not contain a C-5 to C-6 double 
bond, as this is a feature introduced late in the 
biosynthetic pathway (note Scheme 1, below) . Sterols 
contain a Cg-C^o side chain at the C-17 position, as 
shown above . 

The term "phytosterol , " which applies to sterols 
found uniquely in plants, refers to a sterol containing 
a C-5, and in some cases a C-22, double bond. 
Phytosterols are further characterized by alkylation of 
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the C-17 side-chain with a methyl or ethyl substituent 
at the C-24 position. Major phytosterols include, but 
are not limited to, sitosterol, stigmasterol , 
campesterol, brassicasterol , etc. Cholesterol, which 
lacks a C-24 methyl or ethyl side chain, is found in 
plants but is not unique thereto, and is not a 
"phytosterol " 

"Phytostanols" are saturated forms of phytosterols 
wherein the C-5 and, when present, C-22 double bond(s) 
is (are) reduced, and include, but are not limited to, 
sitostanol , campestanol , and 22 -dihydrobrassicastanol . 

"Phytosterol esters" and "phytostanol esters" are 
further characterized by the presence of a fatty acid 
or phenolic acid moiety rather than a hydroxyl group at 
the C-3 position. 

The term "steroid compounds" includes sterols, 
phytosterols, phytosterol esters, phytostanols, and 
phytostanol esters . 

The term "phytosterol compound" refers to at least 
one phytosterol, at least one phytosterol ester, or a 
mixture thereof . 

The term "phytostanol compound" refers to at least 
one phytostanol, at least one phytostanol ester, or a 
mixture thereof . 

The term "constitutive promoter" refers to a 
promoter that operates continuously in a cell, and 
which is not subject to quantitative regulation. The 
gene with which such a promoter is associated is always 
" turned on . " 

The terms "seed-specific," "fruit-specific," 
"plastid-specif ic , " etc., as they apply to promoters 
refer to preferential or exclusive activity of these 
promoters in these organs or organelles, respectively. 
"Preferential expression" refers to promoter activity 
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greater in the indicated organs or organelles than 
elsewhere in the plant. "Seed-specific" comprehends 
expression in the aleurone layer, endosperm, and/or 
embryo of the seed . 

As used herein "isolated polynucleotide" means a 
polynucleotide that is free of one or both of the 
nucleotide sequences which flank the polynucleotide in 
the naturally-occurring genome of the organism from 
which the polynucleotide is derived. The term 
includes, for example, a polynucleotide or fragment 
thereof that is incorporated into a vector or 
expression cassette; into an autonomously replicating 
plasmid or virus; into the genomic DNA of a prokaryote 
or eukaryote; or that exists as a separate molecule 
independent of other polynucleotides. It also includes 
a recombinant polynucleotide that is part of a hybrid 
polynucleotide, for example, one encoding a polypeptide 
sequence . 

As used herein "polynucleotide" and 
"oligonucleotide" are used interchangeably and refer 
to a polymeric (2 or more monomers) form of nucleotides 
of any length, either ribonucleotides or 

deoxyribonucleotides . Although nucleotides are usually 
joined by phosphodiester linkages, the term also 
includes polymeric nucleotides containing neutral amide 
backbone linkages composed of aminoethyl glycine units. 
This term refers only to the primary structure of the 
molecule. Thus, this term includes double- and single- 
stranded DNA and RNA. It also includes known types of 
modifications, for example, labels, methylation, 
"caps", substitution of one or more of the naturally 
occurring nucleotides with an analog, internucleotide 
modifications such as, for example, those with 
uncharged linkages (e.g., methyl phosphonates , 
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phosphotriesters , phosphoamidates , carbamates, etc.) , 
those ' containing pendant moieties, such as, for 
example, proteins (including for e.g., nucleases, 
toxins, antibodies, signal peptides, poly-L-lysine , 
etc.), those with intercalators (e.g., acridine, 
psoralen, etc.), those containing chelators (e.g., 
metals, radioactive metals, boron, oxidative metals, 
etc.), those containing alkylators, those with modified 
linkages (e.g., alpha anomeric nucleic acids, etc.), as 
well as unmodified forms of the polynucleotide. 
Polynucleotides include both sense and antisense 
strands . 

The alternative nucleotide sequences described 
above are considered to possess substantially similar 
enzymatic activity to that of the polypept ide-encoding 
polynucleotide sequences of the present invention if 
they encode polypeptides having enzymatic activity 
differing from that of any of the polypeptides encoded 
by the polynucleotide sequences of the present 
invention by about 3 0% or less, preferably by about 2 0% 
or less, and more preferably by about 10% or less when 
assayed by standard enzymatic assays. 

As used herein "effective amount" is intended to 
qualify the amount of an agent which will achieve the 
goal of a lessening in the severity and/or the 
frequency of incidence of a disease condition or 
disorder, over no treatment. 

The phrase "steroid pathway products" refers to 
the products of steroid biosynthesis produced by the 
action of one or more of squalene epoxidase enzyme, 
sterol methyl transferase I enzyme, sterol C4- 
demethylase enzyme, obtusifoliol C14a-demethylase 
enzyme, sterol C5-desaturase enzyme, and sterol methyl 
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transferase II enzyme. Specific examples of steroid 
pathway products include, but are not limited to, 
sitosterol, sitostanol, stigmasterol and stigmastanol . 

In the context of the present disclosure, a "non- 
transformed" plant or cell refers to a plant or cells 
which does not comprise introduced polynucleotides 
encoding a polypeptide having 3-hydroxy-3- 
methyulglutaryl- Coenzyme A reductase enzyme activity 
and at least one polypeptide having squalene epoxidase 
enzyme activity, sterol methyl transferase I enzyme 
activity, sterol C4 -demethylase enzyme activity, 
obtusifoliol C14a-demethylase enzyme activity, sterol 
C5-desaturase enzyme activity, or sterol methyl 
transferase II enzyme activity. 'Thus, a plant or cell 
that contains introduced polynucleotide sequences other 
than those above, would still be considered "non- 
transformed . " 

As used herein, "peptide" and "protein" are used 
interchangeably and mean a compound that consists of 
two or more amino acids that are linked by means of 
peptide bonds. 

I . Plant Steroid Biosynthesis 

To aid the reader in understanding the present 
invention, descriptions of the sterol compound 
biosynthetic pathway are presented below. These 
descriptions identify enzymes useful in achieving the 
modifications to the biosynthesis and accumulation of 
sterol compounds described herein, and identify sources 
of nucleic acid sequences encoding these enzymes. 

Various steps in the steroid compound biosynthetic 
pathway in plants are shown in Scheme 1, below. The 
numbers over the arrows refer to plant sterol compound 
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biosynthetic pathway enzymes and genes as indicated in 
Table 1 . 
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Table 1 

Plant Sterol Compound Pathway Enzymes and Genes 



Enzyme Step in GenBank 

Pathway Gene ID 



Acetoacetyl -CoA thiolase 
HMG-CoA synthase 
HMG-CoA reductase 



Mevalonate kinase 

Pho sphome va 1 ona te kinase 



Me va 1 ona t e py r ophos pha t e 



de c a r boxy 1 a s e 



Isopentenyl diphosphate 



isomerase 



Farne sy 1 py r opho spha t e 



synthase 



Squalene synthase 



Squalene epoxidase 



Squalene cyclase 
Sterol C-24 
methyl transferase 

Sterol C-4 demethylase 



Cycloeucalenol - 
obtusif oliol isomerase 

Sterol C-14 demethylase 



1 
2 
3 



4 
5 



8 



10 



11 
12 , 18 



13 , 19 



14 



15 



X78116 
X83882 
X15032 
L19262 
X77793 
Not 

available 
Y14325 



U4 92 5 9 
U47324 
X75789 



AF0045 
60 

AB0168 
83 

U87266 
U71400 



Not 

available 
Not 

available 



U74319 
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Sterol C-14 reductase 



Sterol C-8 isomerase 



16 



17 
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PCT WO 



97/48793 
AF0303 
57 



Sterol C-5 desaturase 
Sterol C-7 reductase 
Sterol C-24 isomerase 



Sterol C-24 reductase 



Sterol C-22 desaturase 



20 
21 
22 



23 



24 



Sterol C-5 reductase 



25 



X90454 



U49398 
Klahre 
et al . 
(1998) 
Plant Cell 
10: 1677- 
1690 

Same 
as 22 
Not 

available 
WO 



00/61771 



The plant sterol compound biosynthesis pathway has 
two distinct components. The early pathway reactions, 
leading from acetyl -CoA to squalene via mevalonic acid, 
are common to other isoprenoids. The later pathway 
reactions, leading from squalene to the major plant 
sterol compounds such as sitosterol, campesterol and 
stigmasterol , are committed biosynthetic reactions. 

The early pathway reactions have been studied in 
fungi and plants (Lees et al . , Biochemistry and 
Function of Sterols, Nes and Parish, Eds., CRC Press, 
85-99 (1997); Newman and Chappell, Biochemistry and 
Function of Sterols, Nes and Parish, Eds., CRC Press, 
123-134 (1997); Bach et al . , Biochemistry and Function 
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of Sterols, Nes and Parish, Eds., CRC Press, 135-150 
(1997) ) . 

Acetoacetyl CoA thiolase (EC 2.3.1.9) catalyzes 
the first reported reaction, which consists of the 
formation of acetoacetyl CoA from two molecules of 
acetyl CoA (Dixon et al . , J. Steroid Biochem. Mol . 
Biol. 62: 165-171 (1997)). This enzyme has been 
purified from radish. A radish cDNA has been isolated 
by functional complementation in Saccharomyces 
cerevisiae (GeneBank Accession # X78116) . A radish 
cDNA has also been screened against a cDNA library of 
Arahidopsis tha^liana (Vollack and Bach, Plant 
Physiology nil 1097-1107 (1996)). 

HMGCoA synthase (EC 4.1.3.5) catalyzes the 
production of HMGCoA. This reaction condenses acetyl 
CoA with acetoacetyl CoA to yield HMGCoA. HMGCoA 
synthase has been purified from yeast. A plant HMGCoA 
synthase cDNA has also been isolated from Arahidopsis 
thaliana (Montamat et al . , Gene 167: 197-201 (1995)). 

HMGCoA reductase, also referred to as 3 -hydroxy- 3- 
methyglutaryl- coenzyme A (EC 1.1.1.34), catalyzes the 
reductive conversion of HMGCoA to mevalonic acid (MVA) . 
This reaction is reported to play a role in controlling 
plant isoprenoid biosynthesis (Gray, Adv. Bot. Res. 14: 
25-91 (1987); Bach et al . , Lipids 26: 637-648 (1991); 
Stermer et al . , J". Lipid Res. 35: 1133-1140 (1994). 
Plant HMGCoA reductase genes are often encoded by 
multigene families. The number of genes comprising 
each multigene family varies, depending on the species, 
ranging from two in Arabidopsis thaliana to at least 
seven in potato. Overexpression of plant HMGCoA 
reductase genes in transgenic tobacco plants has been 
reported to result in the overproduction of 
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phytosterols (Schaller et al . , Plant Physiol. 109: 761- 
770 (1995) ) . 

Mevalonate kinase (EC 2.7.1.36) catalyzes the 
phosphorylation of mevalonate to produce mevalonate 5- 
5 phosphate. It has been reported that mevalonate kinase 
plays a role in the control of isoprenoid biosynthesis 
(Lalitha et al . , Indian. J, Biochem. Biophys . 23: 249- 
253 (1986) ) . A mevalonate kinase gene from Arabidopsis 
thaliana has been cloned (GeneBank accession number 
10 X77793; Riou et al . , Gene 148: 293-297 (1994)). 

Phosphomevalonate kinase (EC 2.7,4.2) (MVAP 
^ kinase) is an enzyme associated with isoprene and 

^ ergosterol biosynthesis that converts mevalonate- 5- 

yi phosphate to mevalonate -5 -pyrophosphate utilizing ATP 

J 15 (Tsay et al . , Mbl . Cell. Biol. 11: 620-631 (1991)). 

W Mevalonate pyrophosphate decarboxylase ("MVAPP 

p decarboxylase") (EC 4.1.1.33) catalyzes the conversion 

2; of mevalonate pyrophosphate to isopentenyl diphosphate 

p ("IPP"). The reaction is reported to be a 

1 '5 

2 20 decarboxylat ion/dehydration reaction which hydrolyzes 

E 

ATP and requires Mg2+. A cDNA encoding Arahidopsis 
thaliana MVAPP decarboxylase has been isolated (Toth et 
al . , lJ. Biol. Chem. 271: 7895-7898 (1996)). An 
isolated Arabidopsis thaliana MVAPP decarboxylase gene 

25 was reported to be able to complement the yeast MVAPP 
decarboxylase . 

Isopentenyl diphosphate isomerase ( " IPP : DMAPP" ) 
(EC 5.3.3.2) catalyzes the formation of dimethylallyl 
pyrophosphate (DMAPP) from isopentenyl pyrophosphate 

30 (IPP) . Plant IPP: DMAPP isomerase gene sequences have 
been reported for this enzyme. It has also been 
reported that IPP: DMAPP isomerase is involved in rubber 
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biosynthesis in a latex extract from Hevea (Tangpakdee 
et al., Phytochemistjcy ^5: 261-267 (1997). 

Farnesyl pyrophosphate synthase (EC 2.5.1.1) is a 
prenyltransf erase which has been reported to play a 
role in providing polyisoprenoids for sterol compound 
biosynthesis as well as a number of other pathways (Li 
et al.. Gene 17: 193-196 (1996)). Farnesyl 
pyrophosphate synthase combines DMAPP with IPP to yield 
geranyl pyrophosphate ("GPP"). The same enzyme 
condenses GPP with a second molecule of IPP to produce 
farnesyl pyrophosphate ("FPP"). FPP is a molecule that 
can proceed down the pathway to sterol compound 
synthesis, or that can be shuttled through other 
pathways leading to the synthesis of quinones or 
sesquiterpenes . 

Squalene synthase' (EC 2.5.1.21) reductively 

condenses two molecules of FPP in the presence of Mg2+ 
and NADPH to form squalene.* The reaction involves a 
head-to-head condensation, and forms a stable 
intermediate, presqualene diphosphate. The enzyme is 
subject to sterol demand regulation similar to that of 
HMGCoA reductase. The activity of squalene synthase 
has been reported to have a regulatory effect on the 
incorporation of FPP into sterol and other compounds 
for which it serves as a precursor (Devarenne et al . , 
Arch, Biochem. Biophys . 349: 205-215 (1998)). 

Squalene epoxidase (EC 1.14.99.7) (also called 
squalene monooxygenase) catalyzes the conversion of 
squalene to squalene epoxide (2 , 3 -oxidosqualene) , a 
precursor to the initial sterol molecule in the sterol 
compound biosynthetic pathway, cycloartenol . This is 
the first reported step in the pathway where oxygen is 
required for activity. The formation of squalene 
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epoxide is also the last common reported step in sterol 
biosynthesis of animals, fungi, and plants. 

The later pathway of sterol compound biosynthetic 
steps starts with the cyclization of squalene epoxide 
and ends with the formation of 5 -24-alkyl sterols in 
plants . 

2 , 3 -oxidosqualene cycloartenol cyclase (EC 
5.4.99.8) (also called cycloartenol synthase) is the 
first step in the sterol compound pathway that is 
plant-specific. The cyclization of 2 , 3 -oxidosqualene 
leads to lanosterol in animals and fungi, while in 
plants the product is cycloartenol. Cycloartenol 
contains a 9 , 19-cyclopropyl ring. The cyclization is 
reported to proceed from the epoxy end in a chair-boat - 
chair-boat sequence that is mediated by a transient C- 
20 carbocationic intermediate. 

S-adenosyl-L-methionine : sterol C-24 methyl 
transferase ("SMTl") (EC 2.1.1.41) catalyzes the 
transfer of a methyl group from S-adenosyl -L-methionine 
to the C-24 center of the sterol side chain (Nes et al . 

(1991) J". Biol. Chem. 266 (23 ): 15202 - 152 12 ) . This is 
the first of two methyl transfer reactions that have 
been reported to be an obligatory and rate-limiting 
step of the sterol compound-producing pathway in 
plants. The second methyl transfer reaction occurs 
later in the pathway after the A^"'' isomerase. The 
enzyme responsible for the second methyl transfer 
reaction is named SMTII (Bouvier-Nave , P. et al . , 

(1997) Eur. J. Biochem, , 246: 518-529). An isoform, 
SMTII, catalyzes the conversion of cycloartenol to a 
^23 (24) _24_^lj^yl sterol, cyclosadol (Guo et al . (1996) 
Tetrahed. Lett. 37 (38) : 6823-6826) . 
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Sterol C-4 demethylase catalyzes the first of 
several demethylation reactions, which results in the 
removal of the two methyl groups at C-4. While in 
animals and fungi the removal of the two C-4 methyl 
5 groups occurs consecutively, in plants it has been 

reported that there are other steps between the first 
and second C-4 demethylations . The C-4 demethylation 
is catalyzed by a complex of microsomal enzymes 
consisting of a monooxygenase, an NAD"*" -dependent sterol 
10 4 -decarboxylase, and an NADPH- dependent 3 -ketosteroid 
reductase . 

Cycloeucalenol -obtusif oliol isomerase ( "COI " ) 
catalyzes the opening of the cyclopropyl ring at C-9. 
The opening of the cyclopropyl ring at C-9 creates a 

15 double bond at C-8. 

Sterol C-14 demethylase catalyzes demethylation at 
C-14, which removes the methyl group at C-14 and 
creates a double bond at that position. In both fungi 
and animals, this is the first step in the sterol 

20 synthesis pathway. Sterol 14 -demethylation is mediated 
by a cytochrome P-4 50 complex. 

Sterol C-14 reductase catalyzes a C-14 
demethylation that results in the formation of a double 
bond at C-14 (Ellis et al . , Gen, Microbiol. 137: 2627- 

25 2630 (1991)). This double bond is removed by a A^^ 

reductase. The normal substrate is 4 -methyl -8 , 14 , 24 
(24^) -trien-3(i-ol . NADPH is the normal reductant . 

Sterol C-8 isomerase catalyzes a reaction that 
involves further modification of the tetracyclic rings 

30 or the side chain (Duratti et al . , Biochem, Pharmacol. 
34: 2765-2777 (1985)). The kinetics of the sterol 
isomerase-catalyzed reaction favor a A® 6 A"^ isomerase 
reaction that produces a A'' group. 
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Sterol C-5 desaturase catalyzes the insertion of 
the A^-double bond that normally occurs at the A''- sterol 
level, thereby forming a A^'''-sterol (Parks et al . , 
Lipids 30: 227-230 (1995)). The reaction has been 
reported to involve the stereospecif ic removal of the 
5a and 6a hydrogen atoms, biosynthetically derived from 
the 4 pro-R and 5 pro-S hydrogens of the (+) and (-) R- 
mevalonic acid, respectively. The reaction is 
obligatorily aerobic, and requires NADPH or NADH. The 
desaturase has been reported to be a multienzyme 
complex present in microsomes. It consists of the 
desaturase itself, cytochrome b5 ^ and a pyridine 

nucleotide-dependent f lavoprotein . The A^-desaturase is 
reported to be a mono -oxygenase that utilizes electrons 
derived from a reduced pyridine nucleotide via 
cytochrome b5 . 

Sterol C-7 reductase catalyzes the reduction of a 
A'^-double bond in A^'''-sterols to generate the 
corresponding A^-sterol. It has been reported that the 
mechanism involves, like many other sterol enzymes, the 
formation of a carbocat ionic intermediate via 
electrophilic "attack" by a proton. 

Sterol C-24(28) isomerase catalyzes the reduction 

of a A^'^*^®*-A^\ a conversion that modifies the side 
chain. The product is a A^^*^^* -24 -alkyl sterol. Sterol 
C-24 reductase catalyzes the reduction of the 
double bond at C-24, which produces sitosterol. 
Recently, Klahre et al . ((1998) Plant Cell 10:1677- 
1690) discovered that both the isomerization and 
reduction steps are catalyzed by an enzyme coded by the 
same gene , i.e., DIM/DWFl . 
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Sterol C-22 desaturase (EC 2.7.3.9) catalyzes the 
formation of a double bond at C-22 on the side chain. 
This formation of a double bond at C-22 on the side 
chain marks the end of the sterol compound biosynthetic 
pathway, and results in the formation of stigmasterol 
(Benveniste (1986) Annu. Rev. Plant Physiol. 37:275- 
308) . The C-22 desaturase in yeast, which is the 
reported final step in the biosynthesis of ergosterol 
in that organism, requires NADPH and molecular oxygen. 
In addition, the reaction is also reported to involve a 
cytochrome P450 that is distinct from a cytochrome P450 
participating in demethylation reactions (Lees et al . 
(1995) Lipids 30: 221-226). 

Phytosterols are biogenetic precursors of 
brassinosteroids , steroid alkaloids, steroid 
sapogenins, ecdysteroids , and steroid hormones. This 
precursor role of phytosterols is often described as a 
"metabolic" function. A common transformation of free 
sterols in tissues of vascular plants is the 
conjugation at the 3 -hydroxy group of sterols with 
long-chain fatty acids to form steryl esters, or with a 
sugar, usually with a single molecule of P-D-glucose, 
to form steryl glycosides. Some of the steryl 
glycosides are additionally esterified, at the 6- 
hydroxy group of the sugar moiety, with long -chain 
fatty acids to form acylated steryl glycosides. 

The existence of several enzymes that are 
specifically associated with the synthesis and 
breakdown of conjugated sterols has been reported 
(Wojciechowski, Physiology and Biochemistry of .Sterols, 
eds. Patterson, Nes, AOCS Press, 361 (1991)). Enzymes 
involved in this process include: UDPGlc : Sterol 
glucosyltransf erase , phospho (galacto) glyceride steryl 
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glucoside acyltransf erase , and sterylglycoside and 
sterylester hydrolases . 

UDPGlc : sterol glucosyltransf erase (EC 2.4.1.173) 
catalyzes glucosylation of phytosterols by glucose 
transfer from UDP-glucose ("UDPGl") . The formation of 
steryl glycosides can be measured using UDP- [^^C] glucose 
as the substrate. Despite certain differences in their 
specificity patterns, all reported UDPGlc : sterol 
glucosyltransf erases preferentially glucosylate only 
sterols or sterol-like molecules that contain a C-3 
hydroxy group, a P-conf iguration, and which exhibit a 
planar ring. It has been reported that UDPGlc : sterol 
glucosyltransf erases are localized in the microsomes. 

Phospho (galacto) glyceride steryl glucoside 
acyltransf erase catalyzes the formation of acylated 
steryl glycosides from the substrate steryl glycoside 
by transfer of acyl groups from some membranous polar 
acyllipids to steryl glycoside molecules. 

Acylglycerol : sterol acyltransf erase (EC 2.3.1.26) 
catalyzes the reaction wherein certain acylglycerols 
act as acyl donors in a phytosterol esterif ication . In 
plants, the activity of acylglycerol : sterol 
acyltransf erase is reported to be associated with 
membranous fractions. A pronounced specificity for 
shorter chain unsaturated fatty acids was reported for 
all acyltransf erase preparations studied in plants. 
For example, acylglycerol : sterol acyltransf erases from 
spinach leaves and mustard roots can esterify a number 
of phytosterols. 

Sterylglycoside and sterylester hydrolases ("SG- 
hydrolases") catalyze the enzymatic hydrolysis of 
sterylglycosides to form free sterols. The SG- 
hydrolase activity is not found in mature, ungerminated 
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seeds, is reported to emerge only after the third day 
of germination, and is found mainly in the cotyledons. 
It has been reported that phospho (galacto) glyceride : SG 
acyltranaf erase may catalyze a reversible reaction. 
Enzymatic hydrolysis of sterylesters in germinating 
seeds of mustard, barley and corn is reported to be low 
in dormant seeds, but increases during the first ten 
days of germination. This activity is consistent with a 
decrease in sterylesters and an increase in free 
sterols over the same temporal period. 

II. Processes for Modifying Steroid Compound 
Biosynthesis and Accumulation 

In order to obtain seed producing oil containing 
elevated levels of phytostanols and phytostanol esters 
such as sitostanol and sitostanol esters, these 
recombinant constructs or expression cassettes can be 
introduced into plant cells by any number of 
conventional means known in the art and regenerated 
into fertile transgenic plants. The genome of such 
plants can then comprise introduced DNA encoding 
various steroid pathway enzymes, alone or in 
combination, that achieves the desirable effect of 
enhancing the levels of phytostanols, phytostanol 
esters, mixtures thereof in the oil of seed thereof. 

Preferably, the genome can comprise introduced DNA 
encoding a HMG CoA reductase enzyme and an introduced 
DNA encoding one or more of a squalene epoxidase, a 
sterol methyl transferase I, a sterol C4 -demethylase , 
an obtusifoliol C14a-demethylase , a sterol C5- 
desaturase, a sterol methyl transferase II. In each 
case, the foregoing introduced DNAs can be operatively 
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linked to regulatory signals that cause seed- specif ic 
expression thereof . 

The present invention encompasses not only such 
transgenic plants, but also transformed plant cells, 
including cells and seed of such plants, as well as 
progeny of such plants, for example produced from the 
seed. Transformed plant cells and cells of the 
transgenic plants encompassed herein can be grown in 
culture for a time and under appropriate conditions to 
produce oil containing elevated levels of phytosterols 
and/or phytostanols and their corresponding esters . 
Alternatively, the phytosterols, phytostanols, and 
their corresponding esters can be isolated directly 
from the cultures. 

In addition, of course, seed obtained from the 
transgenic, progeny, hybrid, etc., plants disclosed 
herein can be used in methods for obtaining oil 
containing phytosterols, phytosterol esters, 
phytostanols, phytostanol esters, or mixtures thereof 
employing extraction and processing procedures known in 
the art. Note, in this regard, Kochhar (1983) Prog. 
Lipid Res. 22: 161-188. 

The present invention also encompasses a method of 
producing a plant that accumulates an elevated level of 
sitosterol, at least one sitosterol ester, sitostanol, 
at least one sitostanol ester, or mixtures thereof, in 
seeds thereof compared to seeds of a corresponding 
plant comprising no introduced DNA encoding a 
polypeptide or protein that affects the biosynthesis of 
sterols, phytosterols, phytosterol esters, 
phytostanols, phytostanol esters, or combinations 
thereof, comprising sexually crossing a transgenic 
plant of the present invention with such a 
corresponding plant . The latter can be a non- 
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transgenic plant, or a transgenic plant containing 
introduced DNA encoding a trait other than one 
affecting sterol, phytosterol, etc., biosynthesis. For 
example, such trait may be insect or herbicide 
resistance. Plants produced by this method also form 
part of the present invention. 

Also included are plants that accumulate an 
elevated level of sitosterol, at least one sitosterol 
ester, sitostanol, at least one sitostanol ester, or 
mixtures thereof, in seeds thereof compared to seeds of 
a corresponding plant comprising no introduced DNA 
encoding a polypeptide or protein that affects the 
biosynthesis of sterols, phytosterols , phytosterol 
esters, phytostanols , phytostanol esters, or 
combinations thereof, which are apomictic. 

A process of increasing the formation of steroid 
pathway products in a transformed host cell as compared 
to an otherwise identical non- transformed host cell 
comprising the following steps. A host cell is 
transformed with a recombinant vector comprising (a) as 
operably linked components in the 5* to 3 • direction, a 
promoter, a DNA sequence encoding a polypeptide 
exhibiting 3 -hydroxy- 3 -met hylglutaryl -Coenzyme A 
reductase enzyme activity, and a transcription 
termination signal sequence; and (b) as operably linked 
components in the 5' to 3 * direction, a promoter, a DNA 
sequence encoding a steroid pathway enzyme, and a 
transcription termination signal sequence. The steroid 
pathway enzyme is a squalene epoxidase enzyme, a sterol 
methyl transferase I enzyme, a sterol C4 -demethylase 
enzyme, a obtusifoliol C14a-demethylase enzyme, a 
sterol C5-desaturase enzyme, and a sterol methyl 
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transferase II enzyme. The transformed plant cell is 
regenerated into a transgenic plant. 

A plant contemplated by this invention is a 
vascular, multicellular higher plant. Such higher 
5 plants will hereinafter by usually referred to simply 
as "plants". Such "plants" include both complete 
entities having leaves, stems, seeds, roots and the 
like as well as callus and cell cultures that are 
monocotyledonous and dicotyledonous. Dicotyledonous 
10 plants are a preferred embodiment of the present 
invention. 

Preferred plants are members of the Solanaceae, 
Leguminosae, Ammiaceae, Brassicaceae , Gramineae, 
Carduaceae and Malvaceae families. Exemplary plant 

15 members of those families are tobacco, petunia and 

tomato (Solanaceae) , soybean and alfalfa (Leguminosae) , 
carrot (Ammiaceae) , corn, maize and barley (Gramineae) , 
Arahidopsis (Brassicaceae) , guayule (Carduaceae) , and 
cotton (Malvaceae) . A preferred plant is tobacco of 

20 the strain Nicotiana tabacuin {N. Tahacum) , cotton of 
the strain Coker line 312-5A, soybean of the strain 
Glycine max, alfalfa of the strain RYSI or tomato of 
the strain Lycopersicon esculentium. Other plants 
include canola, maize and rape. 

25 A transgenic plant contemplated by this invention 

is produced by transforming a plant cell or protoplast 
with an added, exogenous structural gene that encodes a 
polypeptide having HMG-CoA reductase activity and an 
exogenous structural gene that encodes at least one 

30 polypeptide have steroid pathway enzyme activity to 
produce a transformed plant cell, and regenerating a 
transgenic plant form the transformed plant cell. The 
encoded polypeptide is expressed both in the 
transformed plant cell or protoplast and the resulting 
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transgenic plant. (The phrase "plant cell" will 
hereinafter be used to include a plant protoplast, 
except where plant protoplasts are specifically 
discussed) . 

A non- transgenic plant that serves as the source 
of the plant cell that is transformed, i.e. the 
precursor cell, is referred to herein as a "native, 
non- transgenic" plant. The native, non- transgenic 
plant is of the same strain as the formed transgenic 
plant . 

Sterol production in a transgenic plant of the 
present invention is increased by increasing the 
activity of the enzyme HMG-CoA reductase, which enzyme 
catalyzes the conversion of 3 -hydroxy-3 -methylglutaryl 
Coenzyme A (HMG-CoA) to mevalonate and the activity of 
at least one other steroid pathway enzyme. As used 
herein, the term "specific activity" means the activity 
normalized to cellular protein content. 

HMG-CoA reductase activity is increased by 
increasing the amount (copy number) of a gene encoding 
a polypeptide having HMG-CoA reductase catalytic 
activity. Expression of the increased amount of that 
encoded structural gene enhances the activity of that 
enzyme . 

The amount of the expressed gene is increased by 
transforming a plant cell with a recombinant DNA 
molecule comprising a vector operatively linked to a 
DNA segment that encodes a polypeptide having HMG-CoA 
reductase activitiy, and a promoter suitable for 
driving the expression of that polypeptide in that 
plant cell, and culturing the transformed plant cell 
into a transgenic plant. Such a polypeptide includes 
intact as well as a catalytically active, truncated 
HMG-CoA reductase proteins. 
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Thus, a transformed plant cell and a transgenic 
plant have one or more added, exogenous genes that 
encode a polypeptide having HMG-CoA reductase activity 
and at least one other steroid pathway enzyme activty 
5 relative to a native, non- transgenic plant or 

untransf ormed plant cell of the same type. As such, a 
transformed plant cell or transgenic plant can be 
distinguished from an untransf ormed plant cell or 
native, nontransgenic plant by standard technology such 

10 as agarose separation of DNA fragments or mRNAs 

followed by transfer and appropriate blotting with DNA 
or RNA, e.g.. Southern or Northern blotting, or by use 
of polymerase chain reaction technology, as are well 
known. Relative HMG-CoA reductase activity of the 

15 transformed cell or transgenic plant with untransf ormed 
cells and native, non- transgenic plants or cell 
cultures therefrom can also be compared, with a 
relative activity for that enzyme of about 1.5:1 for 
transgenic (transformed) to native (untransf ormed) 

20 showing transformation. Higher relative activity 
ratios such as about 15:1 have also been observed. 

Sterol accumulation can also be used to 
distinguish between native, non- transgenic and 
transgenic plants. A transgenic plant has at least 

25 about twice the total sterol content as a native, non- 
transgenic plant where a single added gene is present. 
Greater differences up to about forty- fold have also 
been observed. 

Sitostanol, sitostanol ester, and tocopherol 

30 biosynthesis and accumulation in plants can be modified 
in accordance with the present invention by variously 
expressing the nucleic acid coding sequences discussed 
above, alone or in combination, as described herein. 
The expression of sequences encoding sterol 



58 MTC 6783.1 

PATENT 

methyltransf erase II enzymes facilitates the production 
of plants in which the biosynthesis and accumulation of 
campesterol, campestanol, and their esters can be 
reduced as these enzymes shunt sterol intermediates 
away from campesterol, and toward sitosterol and 
sitostanol . 

Ill . DNA Encoding Useful Polypeptides 
The present invention contemplates a recombinant 
construct or a recombinant vector that contains a DNA 
sequence encoding a polypeptide exhibiting 3-hydroxy-3- 
methylglutaryl -Coenzyme A (HMG-CoA) reductase activity 
and a DNA sequence encoding a polypeptide exhibiting 
the activity of a steroid pathway enzyme. Each 
polypept ide-encoding DNA sequence is operably linked in 
the 5 ' to 3 ' direction independent of the other 
sequence. Each DNA sequence in the 5' to 3 ' direction 
comprises a promoter, then the DNA sequence encoding 
the polypeptide then a transcription termination signal 
sequence. The steroid pathway enzyme is a squalene 
epoxidase enzyme, a sterol methyl transferase I enzyme, 
a sterol C4 -demethylase enzyme, a obtusifoliol C14a- 
demethylase enzyme, a sterol C5-desaturase enzyme, or a 
sterol methyl transferase II enzyme. It is 
contemplated that HMG-CoA reductase and steroid pathway 
enzyme activities come from a mutant or truncated form 
of those enzymes, such as a truncated HMG-CoA reductase 
lacking the transmembrane region while retaining a 
functional catalytic domain. Several HMG CoA reductase 
sequences are known in the art. An amino acid 
alignment for these is shown in FIG. 32. The sources 
of the sequences used in building the multiple 
alignment are listed in Table 5. 
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Table 5. Sources of Sequences Used In Building 
The Multiple Alignment 
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swissprot:hmdh_schma 

swissprot:hmdh_arcfu 

gp_bct1 :psehmgcoa_1 
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026662 methanobacterium thenmoautotrophicur 
Q581 16 methanococcus jannaschii. 3-hydroxy-3 
Q59468 halobacterium volcanii (haloferax volcar 
008424 sulfolobus solfataricus. 3-hydroxy-3-me 
M22255 Saccharomyces cerevisiae Yeast HMG 
M22002 Saccharomyces cerevisiae Yeast HMG 
Q12649 phycomyces blakesleeanus. 3-hydroxy- 
Q12577 fusarium moniliforme (gibberella fujikurc 
AB012603 Candida utilis Candida utilis HMG mF 
P34136 dictyostelium discoideum (slime mold). : 
hydroxymethylglutaryl-CoA reductase (NADPH) 
P48019 oryza sativa (rice). 3-hydroxy-3-methylg 
024594 zea mays (maize). 3-hydroxy-3-methylg 
hydroxymethylglutaryl-CoA reductase (NADPH) 
hydroxymethylglutaryl-CoA reductase (NADPH) 
from proprietary soy sequence database 
Q00583 hevea brasiliensis (para rubber tree). 3- 
Q03163 catharanthus roseus (rosy periwinkle) (i 
P48022 lycopersicon esculentum (tomato). 3-hy( 
Q01559 nicotiana sylvestris (wood tobacco). 3-h 
L01400 Solanum tuberosum Potato hydroxymed 
Q43826 raphanus sativus (radish), hydroxymeth 
L19261 Arabidopsis thaliana Arabldopsis thalian 
AB021862 Cucumis melo Cucumls melo mRNA 
P29058 hevea brasiliensis (para rubber tree). 3-1 
P29057 hevea brasiliensis (para rubber tree). 3-1 
P48021 camptotheca acuminata. 3-hydroxy-3-m 
P43256 arabidopsis thaliana (mouse-ear cress). 
P00347 cricetulus griseus (Chinese hamster), 3-1 
L00183 Cricetulus sp. Hamster 3-hydroxy-3-met 
M12705 Mesocricetus auratus Syrian hamster 3 
P51639 rattus norvegicus (rat). 3-hydroxy-3-met 
Q29512 oryctolagus cuniculus (rabbit). 3-hydrox 
M1 1058 Homo sapiens Human 3-hydroxy-3-met 
M62766 Mus musculus Mouse HMG-CoA reduc 
P2071 5 xenopus laevis (african clawed frog). 3-1 
P16393 strongylocentrotus purpuratus (purple S( 
P54960 blattella germanica (german cockroach) 
PI 4773 drosophila melanogaster (fruit fly). 3-hyc 
P34135 dictyostelium discoideum (slime mold). ' 
P16237 schistosoma mansoni (blood fluke). 3-h) 
028538 archaeoglobus fulgidus. 3-hydroxy-3-m< 
M24015 Pseudomonas mevalonii P.mevalonii HI 



These sequences, and their truncated counterparts, 
are useful in the present invention. Examples of such 
preferred HMG CoA reductases include the truncated 
rubber and Arabidopsis HMG CoA reductases disclosed 
herein . 

Other enzyme -encoding DNAs can be introduced into 
plants to elevate even further the levels of desirable 
A5 sterols and their reduced stanol counterparts as 
well as other phytosterols and tocopherols. Thus, the 
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DNA sequences contemplated for use in the present 
invention, which can be used alone or in various 
combinations as discussed below, include, but are not 
limited to, those encoding the following enzymes: 3- 
hydroxysteroid oxidases; steroid Sreductases; sterol 
methyltransf erases; sterol acyltransf erases ; and S- 
adenosylmethionine-dependent a- tocopherol 

methyltransf erases . 

In each case, the sequences encoding these enzymes 
can comprise an expression cassette comprising, 
operably linked in the 5' to 3 'direction, a seed- 
specific promoter, the enzyme coding sequence, and a 
transcriptional termination signal sequence functional 
in a plant cell such that the enzyme is successfully 
expressed. For use in the methods disclosed herein, 
the recombinant constructs or expression cassettes can 
be incorporated in a vector, for example a plant 
expression vector. Such vectors can be transformed 
into host cells such as bacterial cells, for example 
during the preparation or modification of the 
recombinant constructs, and plant cells. Thus, the 
invention encompasses plants and seeds comprising such 
transformed plant cells. 

It will be apparent to those of skill in the art 
that the nucleic acid sequences set forth herein, 
either explicitly, as in the case of the sequences set 
forth above, or implicitly with respect to nucleic acid 
sequences generally known and not present herein, can 
be modified due to the built-in redundancy of the 
genetic code and noncritical areas of the polypeptide 
that are subject to modification and alteration. In 
this regard, the present invention contemplates allelic 



61 MTC 6783.1 

PATENT 

variants of structural genes encoding a polypeptide 
having HMG-CoA reductase activity. 

The previously described DNA segments are noted as 
having a minimal length, as well as total overall 
length. That minimal length defines the length of a 
DNA segment having a sequence that encodes a particular 
polypeptide having HMG-CoA reductase activity. As is 
well known in the art, as long as the required DNA 
sequence is present (including start and stop signals) , 
additional base pairs can be present at either end of 
the segment and that segment can still be utilized to 
express the protein. This, of course, presumes the 
absence in the segment of an operatively linked DNA 
sequence that represses expression, expresses a further 
product that consumes the enzyme desired to be 
expressed, expresses a product other than the desired 
enzyme or otherwise interferes with the structural gene 
of the DNA segment . 

Thus, as long as the DNA segment is free of such 
interfering DNA sequences, a DNA segment of the 
invention can be up to 15,000 base pairs in length. 
The maximum size of a recombinant DNA molecule, 
particularly a plant integrating vector, is governed 
mostly by convenience and the vector size that can be 
accommodated by a host cell, once all of the minimal 
DNA sequences required for replication and expression, 
when desired, are present. Minimal vector sizes are 
well known. 

Also encompassed by the present invention are 
nucleotide sequences biologically functionally 
equivalent to those disclosed herein, that encode 
conservative amino acid changes within the amino acid 
sequences of the presently disclosed enzymes, producing 
"silent" changes therein. Such nucleotide sequences 
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contain corresponding base substitutions based upon the 
genetic code compared to the nucleotide sequences 
encoding the presently disclosed enzymes. Substitutes 
for an amino acid within the enzyme sequences disclosed 
herein is selected from other members of the class to 
which the naturally occurring amino acid belongs. 
Amino acids can be divided into the following four 
groups: (1) acidic amino acids; (2) basic amino acids; 

(3) neutral polar amino acids; and (4) neutral non- 
polar amino acids. Representative amino acids within 
these various groups include, but are not limited to: 

(1) acidic (negatively charged) amino acids such as 
aspartic acid and glutamic acid; (2) basic (positively 
charged) amino acids such as arginine, histidine, and 
lysine; (3) neutral polar amino acids such as glycine, 
serine, threonine, cysteine, cystine, tyrosine, 
asparagine, and glutamine; and (4) neutral nonpolar 

(hydrophobic) amino acids such as alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan, 
and methionine . 

A. HMG-CoA Reductase 

The introduction of an HMG CoA reductase gene into 
a cell results in a higher carbon throughput through 
the steroid synthesis pathway. The introduction of a 
truncated HMG CoA reductase gene (lacking the 
transmembrane region, resulting in a soluble HMG CoA 
reductase enzyme) provides higher HMG CoA reductase 
activity and thus increased delta-5 steroid compound 
production over the same case with an introduced full- 
length HMG CoA reductase gene. A useful truncated HMG 
CoA reductase nucleic acid encodes at least the 
catalytic domain. 
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Hydroxymethylglutaryl-CoA reductase is enzyme 
number 1.1.1.88, using the recommended nomenclature of 
the International Union of Biochemistry and Molecular 
Biology on the Nomenclature and Classification of 
Enzymes, Enzyme Nomenclature 1992 . Edwin C. Webb, ed . , 
Academic Press, Inc. (San Diego, California: 1992), 
page 35. 

The present invention contemplates transforming a 
plant cell with a structural gene that encodes a 
polypeptide having HMG-CoA reductase activity. The 
HMG-CoA reductase enzymes of both animal and yeast 
cells comprise three distinct amino acid residue 
sequence regions, which regions are designated the 
catalytic region, the membrane -binding region and the 
linker region. 

The catalytic region contains the active site of 
the HMG-CoA reductase enzyme and comprises about forty 
percent of the COOH- terminal portion of intact HMG-CoA 

reductase enzyme . 

The membrane -binding region contains hydrophobic 
amino acid residues and comprises about fifty percent 
of the NH2- terminal portion of intact HMG-CoA reductase 

enzyme . 

The linker region connects the catalytic and 
membrane -binding regions, and constitutes the remaining 
about ten percent of the intact enzyme. 

As discussed in greater detail below, only the 
catalytic region of HMG-CoA reductase is needed herein 
to provide the desired enzyme activity. Thus, an 
exogenous structural gene that encodes a polypeptide 
corresponding to that catalytic region is the minimal 
HMG Co A reductase gene required for transforming plant 
cells in addition to one of the steroid pathway enzymes 
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discussed below. The present invention therefore 
contemplates use of both intact and truncated 
structural genes that encode a polypeptide having HMG- 
CoA reductase activity. 

A structural gene encoding a polypeptide having 
HMG-CoA reductase activity can be obtained or 
constructed from a variety of sources and by a variety 
of methodologies. See, e.g. Carlson et al . , Cell, 
28:145 (1982); Rine et al . , Proc . Natl. Acad. Sci . USA, 
80:6750 (1983). Exemplary of such structural genes are 
the mammalian and yeast genes encoding HMG-CoA 
reductase or the catalytic region thereof. 

The disclosures of Chappell, et al . , U.S. Patent 
No. 5,349,126, are incorporated in full herein by 
reference. The mammalian genome contains a single gene 
encoding HMG-CoA reductase. The nucleotide sequence of 
the hamster and human gene for HMG-CoA reductase have 
been described in Chappell et al . A composite 
nucleotide sequence of DNA corresponds to the mRNA SEQ 
ID N0:1 of Chappell et al . , as well as the derived 
amino acid residue sequence SEQ ID NO : 2 of Chappell et 
al . , for hamster HMG-CoA reductase is provided in Fig. 
2 of Chappell et al , reprinted from Chin et al . , 
Nature, 308:613 (1984). The composite nucleotide 
sequence of Fig. 2, SEQ ID NO : 1 of Chappell et al . , 
comprising about 4768 base pairs, includes the 
nucleotide sequence encoding the intact hamster HMG-CoA 
reductase enzyme . 

Intact hamster HMG-CoA reductase comprises about 
887 amino acid residues (SEQ ID NO: 2 of Chappell et 
al . ) . A structural gene encoding an intact hamster 
HMG-CoA reductase enzyme of 887 amino acid residues 
comprises base pairs from about nucleotide position 164 
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to about nucleotide position 2824 of SEQ ID NO : 1 of 
Chappell et al . 

A preferred structural gene is one that encodes a 
polypeptide corresponding to only the catalytic region 
of the enzyme. Two catalytically active segments of 
hamster HMG-CoA reductase have been defined. Liscum et 
al., cJ. Biol, Chem,, 260 (1) :522 (1985). One segment 
containing a catalytic region has an apparent molecular 
weight of 62 kDa and comprises amino acid residues from 
about position 373 to about position 887. A second 
segment containing a catalytic region has an apparent 
molecular weight of 53 kDa segment and comprises amino 
acid residues from about position 460 to about position 
887. The 62 kDa catalytically active segment is 
encoded by base pairs from about nucleotide position 
1280 to about nucleotide position 2824 of SEQ ID NO:l 
of Chappell et al . The 53 kDa catalytically active 
segment is encoded by base pairs from about nucleotide 
position 1541 to about nucleotide position 2824 of SEQ 
ID NO:l of Chappell et al . 

In a preferred embodiment, the utilized structural 
gene encodes the catalytic region and at least a 
portion of the linker region of HMG-CoA reductase. The 
linker region of hamster HMG-CoA reductase comprises 
amino acid residues from about position 34 0 to about 
position 373 or from about position 340 to about 
position 460, depending upon how the catalytic region 
is defined. These linker regions are encoded by base 
pairs from about nucleotide position 1180 to about 
nucleotide position 1283 or from about position 1180 to 
about position 1540, respectively of SEQ ID NO:l of 
Chappell et al . The structural gene encoding the 
linker region is operatively linked to the structural 
gene encoding the catalytic region. 
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In one particularly preferred embodiment, a 
structural gene encoding a catalytically active, 
truncated HMG-CoA reductase enzyme can optionally 
contain base pairs encoding a small portion of the 
membrane region of the enzyme . 

A structural gene encoding a polypeptide 
comprising a catalytically active, truncated or intact 
HMG-CoA reductase enzyme from other organisms such as 
yeast can also be used in accordance with the present 
invention . 

Yeast cells contain two genes encoding HMG-CoA 
reductase. The two yeast genes, designated HMGl and 
HMG2, encode two distinct forms of HMG-CoA reductase, 
designated HMG-CoA reductase 1 SEQ ID NO : 3 of Chappell 
et al . are presented in Fig. 3 of Chappell et al . , are 
taken from Basson et al . Mol . Cell Biol., 8(9):3797 
(1988) . The nucleotide base sequences of HMG2 SEQ ID 
NO: 5 of Chappell et al . as well as the amino acid 
residue sequence of HMG-CoA reductase 2 SEQ ID NO: 6 of 
Chappell et al . are set forth therein in the Sequence 
Listing . 

The entire HMGl gene comprises about 3360 base 
pairs SEQ ID NO : 3 of Chappell et al . Intact HMG-CoA 
reductase 1 comprises an amino acid sequence of about 
1054 amino acid residues SEQ ID NO: 4 of Chappell et al . 
Thus, the minimal portion of the HMGl gene that encodes 
an intact enzyme comprises base pairs from about 
nucleotide position 121 to about position 3282 of Fig. 
3, SEQ ID NO: 3 of Chappell et al . 

The entire HMG2 gene comprises about 3348 base 
pairs SEQ ID NO: 5 of Chappell et al . Intact HMG-CoA 
reductase 2 comprises about 104 5 amino acid residues 
SEQ ID NO: 6 of Chappell et al . Thus, the minimal 
portion of HMG2 gene that encodes intact HMG-CoA 
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reductase 2 comprises base pairs from about nucleotide 
position 121 to about position 3255 of SEQ ID NO : 5 of 
Chappell et al . 

By analogy to the truncated hamster structural 
gene, structural genes encoding polypeptides comprising 
catalytically active, truncated HMG-CoA reductase 
enzymes from yeast can also be used in accordance with 
the present invention. 

The catalytic region of HMG-CoA reductase 1 
comprises amino acid residues from about residue 618 to 
about reside 1054: i.e., the COOH- terminus . A 
structural gene that encodes the catalytic region 
comprises base pairs from about nucleotide position 
1974 to about position 3282 of Fig. 3 of Chappell et 
al . 

The linker region of HMG-CoA reductase 1 comprises 
an amino acid sequence from about residue 525 to about 
residue 617. A structural gene that encodes the linker 
region comprises nucleotides from about position 1695 
to about position 1973 of Fig. 3 of Chappell et al . A 
structural gene encoding the linker region of the 
enzyme operatively linked to the structural gene 
encoding the catalytic region of the enzyme. 

Also by analogy to the truncated hamster gene, a 
truncated HMGl gene can optionally contain nucleotide 
base pair sequences encoding a small portion of the 
membrane -binding region of the enzyme. Such a 
structural gene preferably comprises base pairs from 
about nucleotide position 121 to about position 147 and 
from about position 1695 to about position 3282 of Fig. 
3 of Chappell et al . 

A construct similar to those above from an 
analogous portion of yeast HMG-CoA reductase 2 can also 
be utilized. 
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A nucleic acid sequence encoding HMG-CoA reductase 
from Hevea brasiliGnsis has been disclosed by Chye et 
al . (1991) Plant Mol . Biol. 16: 567-577. A nucleic 
acid sequence encoding an Arabidopsis thaliana HMG-CoA 
reductase has been published by Caelles et al . (1989) 
Plant Mol. Biol. 13: 627-638, and is also available as 
GenBank accession number L19261. U.S. Patents Nos . 
5,306,862 and 5,365,017 disclose additional DNA 
sequences encoding HMG-CoA reductases. 

The following sequences are listed by Genbank 
Accession numbers: 

02 6 6 62 methanobacterium thermoautotrophicum . 3- 
hydroxy- 3 -methylglutaryl- coenzyme a reductase (ec 
1.1.1-34) (hmg-coa reductase). 12/1998 

Q58116 methanococcus jannaschii. 3 -hydroxy- 3- 
methylglutaryl- coenzyme a reductase (ec d . 1 . 1 . 34 ) (hmg- 
coa reductase) . 7/1998 

Q59468 halobacterium volcanii (haloferax 
volcanii) . 3 -hydroxy- 3 -methylglutaryl -coenzyme a 
reductase (ec 1.1.1.34) (hmg-coa reductase). 7/1998 

008424 sulfolobus solf ataricus . 3-hydroxy-3- 
methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg- 
coa reductase) . 12/1998 

M22255 Saccharomyces cerevisiae Yeast HMG-CoA 
reductase (HGM2) gene, complete cds; 3 -hydroxy- 3 -methyl 
glutaryl coenzyme A reductase. 4/1993 

M22002 Saccharomyces cerevisiae Yeast HMG-CoA 
reductase (HGMl) gene, complete cds; 3 -hydroxy- 3- 
methyl -glutaryl coenzyme A reductase. 4/1993 

Q12 64 9 phycomyces blakesleeanus . 3 -hydroxy- 3- 
methylglutaryl- coenzyme a reductase (ec 1.1,1.34) (hmg- 
coa reductase) (fragment) . 11/1997 
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Q12577 fusarium monilif orme (gibberella 
fujikuroi) . 3 -hydroxy- 3 -methylglutaryl- coenzyme a 
reductase (ec 1.1.1.34) (hmg- coa reductase). 11/1997 

AB012603 Candida utilis Candida utilis HMG mRNA 
for HMG-CoA reductase, complete cds . 7/1998 

P34136 dictyostelium discoideum (slime mold) . 3- 
hydroxy-3 -methylglutaryl -coenzyme a reductase 2 (ec 
1.1.1.34) (hmg- coa reductase 2) (f ragment) . 35735 

PQ0761 hydroxymethylglutaryl-CoA reductase (NADPH) 
(EC 1.1.1.34) (HMGR 10) - wheat (fragment) 

P48019 oryza sativa (rice) . 3-hydroxy-3- 
methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg- 
coa reductase) (fragment) . 2/1996 

024594 zea mays (maize) . 3-hydroxy-3- 
methylglutaryl coenzyme a reductase (ec 1.1.1.88). 
5/1999 

PQ0763 hydroxymethylglutaryl-CoA reductase (NADPH) 
(EC 1.1.1,34) (HMGR 23) - wheat (fragment) 

PQ0 7 62 hydroxymethylglutaryl-CoA reductase (NADPH) 
(EC 1.1.1.34) (HMGR 18) - wheat (fragment) 

from proprietary soy sequence database 

Q00583 hevea brasiliensis (para rubber tree) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase 3 (ec 
1.1.1.34) (hmg- coa reductase 3). 7/1998 

Q03163 catharanthus roseus (rosy periwinkle) 
(madagascar periwinkle) . 3 -hydroxy- 3 -methylglutaryl - 
coenzyme a reductase (ec 1.1.1.34) (hmg-coa reductase). 
7/1998 

P48022 lycopersicon esculentum (tomato) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase 2 (ec 
1.1.1.34) (hmg- coa reductase 2). 7/1998 

Q01559 nicotiana sylvestris (wood tobacco) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase (ec 
1.1.1.34) (hmg-coa reductase). 7/1998 
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L014 0 0 Solanum tuberosum Potato 
hydroxymethylglutaryl coenzyme A reductase (hmgr) mRNA, 
complete cds; putative. 4/1996 

Q43826 raphanus sativus (radish) . 
hydroxymethylglutaryl -coa reductase (ec 1.1,1.34) 
(hydroxymethylglutaryl -coa reductase (nadph) ) (3- 
hydroxy- 3 -methylglutaryl- coenzyme a red 

L19261 Arabidopsis thaliana Arabidopsis thaliana 
HMG-cOA reductase gene, complete cds. 4/1994 

AB021862 Cucumis melo Cucumis melo mRNA for HMG- 
CoA reductase, complete cds; putative. 1/1999 

P29058 hevea brasiliensis (para rubber tree) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase 2 (ec 
1.1.1.34) (hmg- coa reductase 2) (f ragment) . 35735 

P29057 hevea brasiliensis (para rubber tree) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase 1 (ec 
1.1.1.34) (hmg- coa reductase 1). 7/1998 

P4 8 021 camptotheca acuminata. 3 -hydroxy- 3- 
methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg- 
coa reductase) . 11/1997 

P43256 arabidopsis thaliana (mouse-ear cress) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase 2 (ec 
1.1,1.34) (hmg- coa reductase 2) (hmgr2) . 7/1998 

P00347 cricetulus griseus (Chinese hamster) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme A reductase (ec 
1,1.1.34) (hmg-coA reductase). 11/1997 

L00183 Cricetulus sp . Hamster 3-hydroxy-3- 
methylglutaryl coenzyme A (HMG CoA) reductase gene, 
exons 19 and 20; 3 -hydroxy- 3 -methylglutaryl coenzyme A 
(HMG CoA) . 4/1993 

M12705 Mesocricetus auratus Syrian hamster 3- 
hydroxy-3-methylglutaral coenzyme A reductase (HMG-CoA 
reductase) mRNA, complete cds; 3-hydroxy-3- 
methylglutaral coenzyme A red 
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P5163 9 rattus norvegicus (rat) . 3-hydroxy-3- 
methylglutaryl -coenzyme a reductase (ec 1,1.1.34) (hmg- 
coa reductase) . 12/1998 

Q2 9512 oryctolagus cuniculus (rabbit) . 3-hydroxy- 
3 -me thylglutaryl -coenzyme a reductase (ec 1.1.1.34) 
(hmg-coa reductase) . 7/1999 

Ml 10 58 Homo sapiens Human 3 -hydroxy- 3- 
methylglutaryl coenzyme A reductase mRNA, complete cds; 
3-hydroxy-3-methylglutaryl coenzyme A reductase . 11/1994 

M62766 Mus musculus Mouse HMG-CoA reductase mRNA, 
3* end. 4/1993 

P20715 xenopus laevis (african clawed frog) . 3- 
hydroxy- 3 -methylglutaryl- coenzyme a reductase (ec 
1.1,1.34) (hmg-coa reductase). 11/1997 

P163 93 strongylocentrotus purpuratus (purple sea 
urchin) . 3 -hydroxy-3 -methylglutaryl -coenzyme a 
reductase (ec 1.1.1.34) (hmg-coa reductase). 11/1997 

P54 960 blattella germanica (german cockroach) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase (ec 
1.1,1.34) (hmg-coa reductase). 11/1997 

P14773 drosophila melanogaster (fruit fly) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase (ec 
1.1.1.34) (hmg-coa reductase), 12/1998 

P34135 dictyostelium discoideum (slime mold) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase 1 (ec 
1.1.1.34) (hmg- coa reductase 1). 11/1997 

P16237 schistosoma mansoni (blood fluke) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase (ec 
1.1.1.34) (hmg-coa reductase). 7/1998 

02 853 8 archaeoglobus f ulgidus . 3 -hydroxy- 3 - 
methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg- 
coa reductase) . 12/1998 
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M24 015 Pseudomonas mevalonii P.mevalonii HMG-CoA 
reductase (mvaA) gene, complete cds; HMG-CoA reductase 
(EC 1.1.1. 88) . 4/1993 



5 B . Steroid Pathway Enzymes 

The present invention contemplates nucleic acid 
sequences encoding polypeptides having the enzyme 
activity of the steroid pathway enzymes squalene 
epoxidase, sterol methyl transferase I, sterol C4- 
10 demethylase, obtusif oliol C14a-demethylase , sterol C5- 
desaturase and sterol methyl transferase II. 



i . Squalene Epoxidase 

Squalene epoxidase (also called squalene 

15 monooxygenase) catalyzes the .conversion of squalene to 
squalene epoxide (2 , 3 -oxidosqualene) , a precursor to 
the initial sterol molecule in phytosterol biosynthetic 
pathway, cycloartenol . This is the first reported step 
in the pathway where oxygen is required for activity. 

20 The formation of squalene epoxide is also the last 

common reported step in sterol biosynthesis of animals, 
fungi and plants. Recently, several homologues of 
Arahidopsis and Brassica squalene epoxidase genes were 
reported (Schafer, U.A. , Reed, D.W., Hunter, D.G., Yao, 

25 K. , Weninger, A.M., Tsang, E.W., Reaney, M.J., 

MacKenzie, S.L., and Covello, P.S. (1999). Plant Mol . 
Biol. 39(4): 721-728). The same authors also have a PCT^^ 
application disclosing the use of antisense technology 
with squalene epoxidase to elevate squalene levels in 

30 plants (WO 97/34003) . However, to date there are no 
reports on functional characterization of any plant 
squalene epoxidase gene or enzyme. 
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Squalene Epoxidase, also known as squalene 
monooxygenase is enzyme reference number 1.14.99.7, 
Enzyme Nomenclature 1992 , p. 146. 

Several squalene epoxidase enzymes are known to 
the art. These include Arahidopsis squalene epoxidase 
protein sequence Accession No. AC004786 (SEQ ID N0:1), 
Arahidopsis squalene epoxidase Accession No. N64916 
(SEQ ID N0:2), and Arabidopsis squalene epoxidase 
Accession No. T44667 (SEQ ID NO : 3 ) . Japanese patent 
application No. 07194381 A discloses a DNA encoding a 
mammalian squalene epoxidase. 

In order to facilitate the modifications to sterol 
biosynthesis and accumulation described herein, the 
present invention also provides an isolated DNA 
molecule, comprising a nucleotide sequence selected 
from the group consisting of: 

(a) Arabidopsis squalene epoxidase from clone ID 
ATA506263 disclosure SEQ ID NO : 4 , clone ID ATA304243 
disclosure SEQ ID NO: 6, clone ID ATA102071 disclosure 
SEQ ID NO: 8, clone ATA504158 disclosure SEQ ID NO: 10, 
or the complement thereof; 

(b) a nucleotide sequence that hybridizes to said 
nucleotide sequence of (a) under a wash stringency 
equivalent to 0 . 5X SSC to 2X SSC, 0.1% SDS, at 55-65°C, 
and which encodes a polypeptide having squalene 
epoxidase enzymatic activity substantially similar to 
that of the disclosed squalene epoxidase; 

(c) a nucleotide sequence encoding the same 
genetic information as said nucleotide sequence of (a) , 
but which is degenerate in accordance with the 
degeneracy of the genetic code; and 

(d) a nucleotide sequence encoding the same 
genetic information as said nucleotide sequence of (b) , 
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but which is degenerate in accordance with the 
degeneracy of the genetic code. 

An additional aspect of the invention is the 
recombinant constructs and vectors (pMON4 8343, Fig. 30; 
5 pMON43844, Fig. 31) comprising nucleic acid sequences 
encoding the novel squalene epoxidase, as well as a 
method of producing the novel squalene epoxidase, 
comprising culturing a host cell transformed with the 
novel constructs or vectors for a time and under 
10 conditions conductive to the production of the squalene 
epoxidase, and recovering the squalene epoxidase 
produced thereby. 

ii , Sterol Methyl Transferase 1 

15 S-adenosyl-L-methionine : sterol C-24 methyl 

transferases (SMTl and SMT2) catalyze the transfer of a 
methyl group from a cofactor, S-adenosyl-L-methionine, 
to the C-24 center of the sterol side chain (Bach, T.J. 
and Benveniste, P. (1997), Prog. Lipid Res. 36: 197- 

20 226) . SMT in higher plant cells are responsible for 

their capability to produce a mixture of 24 -methyl and 
24-ethyl sterols (Schaffer, A., Bouvier-Nave, 
Benveniste, P., Schaller, H. (2000) Lipids 35: 263- 
269) . Functional characterization of the SMT using a 

25 yeast ergS expression system demonstrated unambiguously 
that an SMTl sequence encodes a cycloartenol-C24 - 
methyltransf erase and a SMT2 sequence encodes a 24- 
methylene lophenol -C24 -methyltransf erase in a given 
plant species (Bouvier-Nave, P., Husselstein, T. , and 

30 Benveniste, P. (1998), Eur. J. Biochem. 246: 518-529). 
Several plant genes coding for SMTl and SMT2 have been 
reported and reviewed (Schaffer, A., Bouvier-Nave, 
Benveniste, P., Schaller, H. (2000) Lipids 35: 263- 
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269) . Transgenic plants expressing homologues of either 
SMTl or SMT2 have been studied (Schaf fer, A. , Bouvier- 
Nave, Benveniste, P., Schaller, H. (2000) Lipids 35: 
263-269) . The use of these genes to modify plant sterol 
composition are also covered by two Monsanto patent 
applications (WO 98/45457 and WO 00/61771) . 

Sterol methyl transferase I enzymes known in the 
art are useful in the present invention. Examplary 
sequences include the known Arahidopsis sterol methyl 
transferase I protein sequence Accession No. U71400 
(disclosure SEQ ID NO: 19), the known tobacco sterol 
methyl transferase I protein sequence Accession No. 
U81312 (disclosure SEQ ID NO: 20) and Ricinus communis 
sterol-C-methyltransf erase, Eur. J". Biochem. , 246(2), 
518-529 (1997) . (Complete cds. Accession No. g2246457) . 

S-Adenosyl -L-Methionine - Sterol -C24 - 
MethyltransferasG- -A nucleic acid sequence encoding an 
Arabidopsis thaliana. S-adenosyl -L-methionine-sterol - 
C24-methyltransf erase has been published by Husselstein 
et al . (1996) FEES Letters 381: 87-92. A^^-sterol C- 
methyltransf erase is enzyme number 2.1.1.41, Enzyme 
Nomenclature 19 92 , page 160. 

iii . Sterol C4 -Demethvlase 

Sterol C-4 demethylase catalyses the first of 
several demethylation reactions, which results in the 
removal of the two methyl groups at C-4 . While in 
animals and fungi the removal of the two C-4 methyl 
groups occurs consecutively, in plants it has been 
reported that there are other steps between the first 
and second C-4 demethylations (Bach, T.J. and 
Benveniste, P. (1997), Prog. Lipid Res. 36: 197-226). 
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The C-4 demethylation is catalyzed by a complex of 
microsomal enzymes consisting of a monooxygenase , an 
NAD^- dependent sterol 4 -decarboxylase and an NADPH- 
dependent 3 -ketosteroid reductase. 

iv . Obtusif oliol C14a"Demethylase 

Sterol C-14 demethylase catalyzes demethylation at 
C-14 which removes the methyl group at C-14 and creates 
a double bond at that position. In both fungi and 
animals, this is the first step in the sterol synthesis 
pathway. However, in higher plants, the 14a-methyl is 
removed after one C-4 methyl has disappeared. Thus, 
while lanosterol is the substrate for C-14 demethylase 
in animal and fungal cells, the plants enzyme uses 
obtusif oliol as substrate. Sterol 14 -demethylation is 
mediated by a cytochrome P-450 complex. The mechanism 
of 14a-methyl removal involves two oxidation steps 
leading to an alcohol, then an aldehyde at C-2 9 and a 
further oxidative step involving a def ormylation 
leading to formic acid and the sterol product with a 
typical 8,14-diene (Aoyama, Y., Yoshida, Y. , Sonoda, 
Y., and Sato, Y. (1989) J. Biol. Chem. 264: 18502- 
18505) . Obtusifoliol 14a-demethylase from Sorghum 
bicolor (L) Moench has been cloned using a gene- 
specific probe generated using PCR primers designed 
from an internal 14 amino acid sequence and was 
functionally expressed in E. coli (Bak, S, Kahn, R.A. , 
Olsen, C.E. and Halkier, B.A. (1997) The Plant Journal 
11(2): 191-201). Also, Saccharomyces cerevisiae CYP51A1 
encoding lanosterol - 14 -demethylase was functionally 
expressed in tobacco (Grausem, B., Chaubet, N. , Gigot, 
C, Loper, J.C., and Benveniste, P. (1995) The Plant 
Journal 7(5): 761-770). 



\ 



77 MTC 6783.1 

PATENT 

Sterol C-14 demethylase enzymes and sequences are 
known in the art. For example Sorghum bicolor 
obtusifoliol 14a-demethylase CYP51 mRNA, described in 
Plant J"., 11(2):191-201 (1997) (complete cds Acession 
5 No. U74319) . In order to facilitate the modifications 
to sterol biosynthesis and accumulation described 
herein, the present invention also provides an isolated 
DNA molecule, having a nucleotide sequence selected 
from the group consisting of: 
10 (a) obtusifoliol C14a-demethylase from clone ID: 

ATA101105 disclosure SEQ ID NO: 14, clone ID ATA202967 
disclosure SEQ ID NO: 15, clone ID ATA403931 disclosure 
SEQ ID NO: 17, or the complement thereof; 

(b) a nucleotide sequence that hybridizes to said 
15 nucleotide sequence of (a) under a wash stringency 

equivalent to 0 . 5X SSC to 2X SSC, 0.1% SDS , at 55-65°C, 
and which encodes a polypeptide having obtusifoliol 
C14a-demethylase enzymatic activity substantially 
similar to that of the disclosed obtusifoliol C14a- 
20 demethylase; 

(c) a nucleotide sequence encoding the same 
genetic information as said nucleotide sequence of (a) , 
but which is degenerate in accordance with the 
degeneracy of the genetic code; and 

25 (d) a nucleotide sequence encoding the same 

genetic information as said nucleotide sequence of (b) , 
but which is degenerate in accordance with the 
degeneracy of the genetic code. 

An additional aspect of the invention is the 

30 recombinant constructs and vectors (pMON43842, Fig. 29) 
comprising nucleic acid sequences encoding* the novel 
obtusifoliol C14a-demethylase , as well as a method of 
producing the novel obtusifoliol C14a-demethylase , 
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comprising culturing a host cell transformed with the 
novel constructs or vectors for a time and under 
conditions conductive to the production of the 
obtusifoliol C14a-demethylase, and recovering the 
5 obtusifoliol C14a-demethylase produced thereby. 




V. Sterol C5 -Desaturase 

Sterol C-5 desaturase catalyzes the insertion of 
the A^-double bond that normally occurs at the A''- sterol 
10 level, thereby forming a A^'''-sterol (Parks et al . , 
Lipids 30:227-230 (1995)). The reaction has been 
reported to involve the stereospecif ic removal of the 
5a and 6a hydrogen atoms, biosynthetically derived from 
the 4 pro-R and 5 pro-S hydrogens of the (+) and (-) R- 
15 mevalonic acid, respectively (Goodwin, T.W. (1979) 
^ Annu. Rev. Plant Physiol. 30: 369-404). The reaction 

Q is obligatorily aerobic and requires NADPH or NADH. 

fi\ The desaturase has been reported to be a multienzyme 

^ complex present in microsomes. It consists of the 

20 desaturase itself, cytochrome and a pyridine 

nucleotide-dependent f lavoprotein . The A^-desaturase is 
reported to be a mono -oxygenase that utilizes electrons 
derived from a reduced pyridine nucleotide via 
cytochrome^ (Taton, M. , and Rahier, A. (1996) Arch. 
25 Biochem. Biophys . 325: 279-288). An Arabidopsis 

thaliana cDNA encoding a sterol -C5 -desaturase was 
cloned by functional complementation of a yeast mutant, 
erg3 defective in ERGS, the gene encoding the sterol 
C5-desaturase required for ergosterol biosynthesis 
30 (Gachotte D., Husselstein, T. , Bard, M. , Lacroute F. , 
and Benveniste, P. (1996) The Plant Journal 9(3): 391- 
3 98) . Known sterol C5 -desaturase enzymes are useful in 
the present invention, including Arabidopsis sterol C5- 
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desaturase protein sequence Accession No. X90454, 
disclosure SEQ ID NO: 22, and the Arahidopsis thaliana 
mRNA for sterol -C5 -desaturase described in Plant J, 
9(3):391-398 (1996) (complete cds Accession No. 
5 gl061037) . 

The NCBI* (National Center for Biotechnology 
Information) database shows 37 sequences for sterol 
desaturase that are useful in the present invention. 
The following are exemplary of such sequences. From 

10 yeast: C5 sterol desaturase NP_013157 (Saccharomyces 

cerevisiae) ; hypothetical C5 sterol desaturase-f ission 
T40027 (Schizosaccharomyces pombe) ; C5 sterol 
desaturase-f ission T37759 (Schizosaccharomyces pombe) ; 
C5 sterol desaturase JQ1146 (Saccharomyces cerevisiae) ; 

15 C5 sterol desaturase BAA21457 (schizosaccharomyces 
pombe) ; C5 sterol desaturase CAA22610 
(Schizosaccharomyces pombe) ; putative C5 sterol 
desaturase CAA16898 (Schizosaccharomyces pombe) ; 
probable C5 sterol desaturase 013666 (erg3 _schpo) ; C5 

20 sterol desaturase P50860 (Erg3_canga) ; C5 sterol 
desaturase P32353 (erg3_yeast) ; C5,6 desaturase 
AAC99343 (Candida albicans); C5 sterol desaturase 
BAA20292 (Saccharomyces cerevisiae) ; C5 sterol 
desaturase AAB39844 (Saccharomyces cerevisiae) ; C5 

25 sterol desaturase AAB29844 (Saccharomyces cerevisiae) ; 
C5 sterol desaturase CAA64303 (Saccharomyces 
cerevisiae) ; C5 sterol desaturase AAA34595 
(Saccharomyces cerevisiae) ; C5 sterol desaturase 
AAA34594 (Saccharomyces cerevisiae) . From plants: C5 

30 sterol desaturase S71251 (Arabidopsis thaliana) ; 

putative sterol -C5-desaturase AAF32466 (Arabidopsis 
thaliana) ; sterol-C5-desaturase AAF32465 (Arabidopsis 
thaliana) ; putatuve sterol desaturase AAF22 921 
(Arabidopsis thaliana) ; delta7 sterol C5 desaturase 



80 MTC 6783.1 

PATENT 

(Arabidopsis thaliana) ; sterol C5(6) desaturase homolog 
AAD20458 (Nicotiana tabacum) ; sterol C5 desaturase 
AAD12944 (Arabidopsis thaliana); sterol C5 , 6 desaturase 
AAD04034 (Nicotiana tabacum) ; sterol C5 desaturase 
CAA62079 (Arabidopsis thaliana). From mammals: sterol- 
C5-desaturase (Mus musculus) BAA33730; sterol-C5- 
desaturase BAA3 3 72 9 (Homo sapiens) ; lathosterol oxidase 
CAB65928 (Leishmania major) ; lathosterol oxidase 

(lathosterol 5-desaturase) 088822 (Mus musculus) ; 
lathosterol 5-desaturase 075845 (Homo sapiens) ; delta7 
sterol C5 desaturase AAF00544 (Homo sapiens). Others: 
fungal sterol C5 desaturase homolog BAA18 97 0 (Homo 
sapiens) . 

* 

For DNA sequences encoding a sterol -C5-desaturase 
useful in the present invention, the NCBI_nucleotide 
search for "sterol desaturase" came up with 110 
sequences. The following are exemplary of such 
sequences. NC_001139 (Saccharomyces cerevisiae) ; 
NC_001145 (Saccharomyces cerevisiae) ; NC_001144 
(Saccharomyces cerevisiae) ; AW700015 (Physcomitrella 
patens) ; ABO 04 53 9 (Schizosaccharomyces pombe) ; and 
AW596303 (Glycine max) ; AC012188 (Arabidopsis 
thaliana) . 

vi . Sterol Methyl Transferase II 
The combination of introduction of an HMG-CoA 
reductase gene along with a sterol methyl transferase 
II gene into a cell serves to reduce steroid pathway 
intermediate compound accumulation in addition to 
reducing the accumulation of 24 -methyl sterols such as 
campesterol . 

Known sterol methyl transferase II enzymes are 
useful in the present invention, including Arahidopsis 
sterol methyl transferase II protein sequence (complete 
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mRNA cds from FEES Lett. 381 (12) : 87-92 (1996) Accession 
No. X89867) , disclosure SEQ ID NO:21. 

Recombinant constructs encoding any of the 
forgoing enzymes affecting the steroid biosynthetic 
pathway can be incorporated into recombinant vectors 
comprising the recombinant constructs comprising the 
isolated DNA molecules. Such vectors can be bacterial 
or plant expression vectors. 

IV. Recombinant Constructs and Vectors 
The present invention contemplates a recombinant 
construct that contains a DNA sequence encoding a 
polypeptide exhibiting 3 -hydroxy- 3 -methylfluaryl- 
Coenzyme A (HMG-CoA) reductase activity and a DNA 
sequence encoding a polypeptide exhibiting the activity 
of a steroid pathway enzyme. Each polypeptide -encoding 
DNA sequence is operably linked in the 5' to 3 ^ 
direction independent of the other sequence. Each DNA 
sequence in the 5' to 3 ' direction comprises a 
promoter, then the DNA sequence encoding the 
polypeptide then a transcription termination signal 
sequence. The steroid pathway enzyme is a squalene 
epoxidase enzyme, a sterol methyl transferase I enzyme, 
a sterol C4 -demethylase enzyme, a obtusifoliol C14a- 
demethylase enzyme, a sterol C5 -desaturase enzyme, or a 
sterol methyl transferase II enzyme. 

Preferably, the promoters in the recombinant 
construct are seed-specific promoters. In one 
embodiment, the promoter is derived from a species in a 
different order from the host cell. In other 
embodiments, the encoded HMG CoA reductase and/or 
steroid pathway enzymes is (are) from a species in a 
different order from the order that of the host cell. 
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It is contemplated that a construct comprises more 
than one of the DNA sequences encoding a steroid 
pathway enzyme . 

The invention also contemplates a recombinant 
vector comprising the above -described recombinant 
construct, wherein that vector is preferably a plant 
expression vector. 

A recombinant DNA molecule of the present 
invention can be produced by operatively linking a 
vector to a useful DNA segment discussed herein to form 
a plasmid. A vector capable of directing the 
expression of a polypeptide having HMG-CoA reductase 
activity is referred to herein as an HMG-CoA reductase 
"plant integrating vector" . 

Such plant integrating vectors contain control 
elements that direct and regulate expression, including 
a promoter, a marker, a terminator and insertion 
sequence (e.g. FIG. 5) . The polypeptide coding genes 
are operatively linked to the plant integrating vector 
to allow the promoter sequence to direct RNA polymerase 
binding and expression of the desired polypeptide 
coding gene . 

Useful in expressing the polypeptide coding gene 
are promoters that are inducible, viral, synthetic, 
constitutive as described by Poszkowski et al . , EMBO 
J., 3:2719 (1989) and Odell et al . , Nature, 313:810 
(1985) , and temporally regulated, spatially regulated 
and spatiotemporally regulated as given in Chau et al . , 
Science, 244:174-181 (1989). The promoter preferably 
comprises a promoter sequence whose function in 
regulating expression of the structural gene is 
substantially unaffected by the amount of sterol or 
squalene in the cell. As used herein, the term 
"substantially unaffected" means that the. promoter is 
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not responsive to direct feedback control by the 
sterols or squalene accumulated in transformed cells or 
transgenic plants . 

A promoter is also selected for its ability to 
direct the transformed plant cell's or transgenic 
plant's transcriptional activity to the structural gene 
encoding a polypeptide having HMG-CoA reductase 
activity. Structural genes can be driven by a variety 
of promoters in plant tissues. Promoters can be near- 
constitutive, such as the CaMV 35S promoter, or tissue 
specific or developmentally specific promoters 
affecting dicots or monocots. 

As exemplified and discussed in detail herein, 
where the near-constitutive promoter CaMV 35S is used 
to transf-orm tobacco plants, increases in total sterol 
and squalene accumulation are found in a variety of 
transformed plant tissues (e.g. callus, leaf, seed and 
root) . Alternatively, the effects of transformation 
(e.g. increased amount of a gene coding for HMG-CoA 
reductase, increased total sterol accumulation and 
increased squalene accumulation) can be directed to 
specific plant tissues by using plant integrating 
vectors containing a tissue-specific promoter. 

An exemplary tissue-specific promoter is the 
Lectin promoter, which is specific for seed tissue. 
The Lectin protein in soybean seeds is encoded by a 
single gene (Lei) that is only expressed during seed 
maturation and accounts for about 2 to about 5 percent 
of total seed mRNA. The Lectin gene and seed-specific 
promoter have been fully characterized and used to 
direct seed specific expression in transgenic tobacco 
plants. See, e.g., Vodkin et al . , Cell, 34:1023 (1983) 
and Lindstrom et al , , Developmental Genetics, 11:160 
(1990) . 
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A plant integrating vector containing a structural 
gene coding for a polypeptide having HMG-CoA reductase 
activity is engineered to be under control of the 
Lectin promoter and that vector is introduced into 
soybean plants using a protoplast transformation 
method. E.G. Dhir et al . , Plant Cell Reports, 10:97 
(1991) . The expression of the polypeptide having HMG- 
CoA reductase activity is directed specifically to the 
seeds of the transgenic plant. In this way, a 
transgenic soybean seed having increased squalene 
accumulation is produced. Such seeds can then be used 
to extract oil containing enhanced levels of squalene. 
As set forth hereinafter, such squalene-enhanced oil is 
characterized by a greater thermal stability when 
compared to non- squalene -enhanced oil. 

In the present invention, a plant has an 
exogenously provided structural gene for HMG-CoA 
reductase and at least one of the six enumerated 
steroid pathway enzymes, a squalene epoxidase enzyme, a 
sterol methyl transferase I enzyme, a sterol C4- 
demethylase enzyme, a obtusifoliol C14a-demethylase 
enzyme, a sterol C5-desaturase enzyme, or a sterol 
methyl transferase II enzyme. The plant or seed thus 
containing these added genes is contemplated, while the 
methods to arrive at a plant or seed according to the 
invention are open to the multitude of methods 
contemplated by a person of ordinary skill in the art. 
In particular, all of the added structural genes do not 
have to have been added at the same time, or by the 
same route. Thus, for example, the HMG-CoA reductase 
activity may result from a cross with a plant made 
according to a process of U.S. Patent No. 5,349,126, 
while a steroid pathway enzyme is added by nucleic acid 
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bombardment to that plant. Further, when more than one 
nucleotide sequence encoding a steroid pathway enzyme 
is present in a contemplated plant, the expression of 
the gene does not have to be under the control of the 
same promoter, or even the same type of promoter. 

The choice of which plant integrating vector and 
ultimately to which promoter a polypeptide coding gene 
is operatively linked depends directly on the 
functional properties desired, e.g. the location and 
timing of protein expression, and the host cell to be 
transformed. These are well known limitations inherent 
in the art of constructing recombinant DNA molecules. 
However, a vector useful in practicing the present 
invention is capable of directing the expression of the 
polypeptide coding gene, i.e., the gene encoding HMG- 
CoA reductase activity, included in the DNA segment to 
which it is operatively linked. 

Typical vectors useful for expression of genes in 
higher plants are well known in the art and include 
vectors derived from the tumor- inducing (Ti) plasmid of 
AgrobactGriuin tumefaciens, described by Rogers et al . , 
Meth. Enzymol., 153:253-277 (1987). However, several 
other plant integrating vector systems are known to 
function in plants including pCaMVCN transfer control 
vector described by Fromm et al . Proc. Nat, Acad. Sci. 
USA, 82:5824 (1985). Plasmid pCaMVCN (available from 
Pharmacia, Piscataway, N.J.) includes the cauliflower 
mosaic virus CaMV 3 5S promoter. 

The use of retroviral plant integrating vectors to 
form the recombinant DNAs of the present invention is 
also contemplated. As used herein, the term 
"retroviral plant integrating vector" refers to a DNA 
molecule that includes a promoter sequence derived from 
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the long terminal repeat (LTR) region of a retrovirus 
genome . 

In preferred embodiments, the vector used to 
express the polypeptide coding gene includes a 
selection marker that is effective in a plant cell, 
preferably a drug resistance selection marker. One 
preferred drug resistance marker is the gene whose 
expression results in kanamycin resistance; i.e., the 
chimeric gene containing the nopal ine synthase 
promoter, Tn5 neomycin phosphotransferase II and 
nopaline synthase 3' nontranslated region described by 
Rogers et al . , in Methods for Plant Molecular Biology, 
A. Weissbach and H. Weissbach eds . , Academic Press 
Inc., San Diego, Calif. (1988). 

A variety of methods have been developed to 
operatively link DNA to vectors via complementary 
cohesive termini or blunt ends. For instance, 
complementary homopolymer tracts can be added to the 
DNA segment to be inserted and to the vector DNA. The 
vector and DNA segment are then joined by hydrogen 
bonding between the complementary homopolymer ic tails 
to form recombinant DNA molecules . 

Alternatively, synthetic linkers containing one or 
more restriction endonuclease sites can be used to join 
the DNA segment to the plant integrating vector. The 
synthetic linkers are attached to blunt -ended DNA 
segments by incubating the blunt -ended DNA segments 
with a large excess of synthetic linker molecules in 
the presence of an enzyme that is able to catalyze the 
ligation of blunt-ended DNA molecules, such as 
bacteriophage T4 DNA ligase. Thus, the products of the 
reaction are DNA segments carrying synthetic linker 
sequences at their ends. These DNA segments are then 
cleaved with the appropriate restriction endonuclease 
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and ligated into a plant integrating vector that has 
been cleaved with an enzyme that produces termini 
compatible with those of the synthetic linker. 
Synthetic linkers containing a variety of restriction 
endonuclease sites are commercially available from a 
number of sources including New England BioLabs, 
Beverly, Mass. 

Also contemplated by the present invention are RNA 
equivalents of the above -described recombinant DNA 
molecules . 

A preferred recombinant DNA molecules utilized in 
accordance with the present invention are piyiON53 73 3- 
pMON53 74 0 (Figures 13-2 0) . 

A. Promoters and Target Tissues 

Promoters useful in the present invention include 
those that confer appropriate cellular and temporal 
specificity of expression. Such promoters include 
those that are constitutive or inducible, 
environmentally- or developmentally-regulated, or 
organelle-, cell-, or tissue-specific. Preferred 
promoters for use with the present invention promote 
expression of the introduced enzymes in the seed in the 
cytosol, although expression in plasids or organelles 
of the seeds is also contemplated. 

Often-used constitutive promoters include the CaMV 
35S promoter (Odell et al . (1985) Nature 313: 810), the 
enhanced CaMV 3 5S promoter, the Figwort Mosaic Virus 
(FMV) promoter (Richins et al . (1987) NAR 20: 8451), 
the mannopine synthase (mas) promoter, the nopal ine 
synthase (nos) promoter, and the octopine synthase 
(ocs) promoter . 

Useful inducible promoters include heat -shock 
promoters (Ou-Lee et al . (1986) Proc . Natl. Acad. Sci . 
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USA 83: 6815; Ainley et al . (1990) Plant Mol . Biol. 14: 
949) , a nitrate-inducible promoter derived from the 
spinach nitrite reductase gene (Back et al . (1991) 
Plant Mol , Biol. 17: 9), hormone -inducible promoters 
5 (Yamaguchi-Shinozaki et al . (1990) Plant Mol . Biol. 15: 
905; Kares et al . (1990) Plant Mol . Biol. 15: 905), and 
light -inducible promoters associated with the small 
subunit of RuBP carboxylase and LHCP gene families 
(Kuhlemeier et al . (1989) Plant Cell 1: 471; Feinbaum 

10 et al. (1991) Mol . Gen. Genet. 226: 449; Weisshaar et 
al. (1991) EMBO J. 10: 1777; Lam and Chua (1990) 
Science 248: 471; Castresana et al . (1988) EMBO J. 7: 
1929; Schulze-Lef ert et al . (1989) EMBO J. 8: 651). 
Examples of useful tissue-specific, 

15 developmental ly- regulated promoters include fruit- 
specific promoters such as the E4 promoter (Cordes et 
al . (1989) Plant Cell 1:1025), the E8 promoter (Deikman 
et al . (1988) EMBO J. 7: 3315), the kiwifruit actinidin 
promoter (Lin et al . (1993) PNAS 90: 5939), the 2A11 

20 promoter (Houck et al . , U.S. Patent 4,943,674), and the 
tomato pZ130 promoter (U.S. Patents 5,175, 095 and 
5,530,185); the p-conglycinin 7S promoter (Doyle et al . 

(1986) J. Biol. Chem. 261: 9228; Slighton and Beachy 

(1987) Planta 172: 356), and seed-specific promoters 
25 (Knutzon et al . (1992) Proc . Natl. Acad. Sci . USA 89: 

2624; Bustos et al . (1991) EMBO J. 10: 1469; Lam and 
Chua (1991) J". Biol. Chem. 266: 17131; Stayton et al . 
(1991) Aust. J. Plant. Physiol. 18: 507). Fruit- 
specific gene regulation is discussed in U.S. Patent 
30 5,753,475. Other useful seed-specific promoters 

include, but are not limited to, the napin, phaseolin, 
zein, soybean trypsin inhibitor, 7S, ADR12, ACP, 
stearoyl-ACP desaturase, oleosin, Lasquerella 
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hydroxylase, and barley aldose reductase promoters 
(Bartels (1995) Plant J, 7: 809-822), the EA9 promoter 
(U.S. Patent 5,42 0,034), and the Bce4 promoter (U.S. 
Patent 5,530,194). Useful embryo- specif ic promoters 
include the corn globulin 1 and oleosin promoters. 
Useful endosperm-specific promoters include the rice 
glutelin-1 promoter, the promoters for the low-pl 
amylase gene (Amy32b) (Rogers et al . (1984) J. Biol. 
Chem. 259: 12234), the high-pl -amylase gene (Amy 64) 
(Khurseed et al . (1988) J. Biol. ChGiri. 263: 18953), and 
the promoter for a barley thiol protease gene 
("Aleurain") (Whittier et al . (1987) Nucleic Acids Res. 
15 : 2515) . 

Appropriate target tissues of plants for enhanced 
production of sterol compounds such as sitosterol, 
sitosterol esters, sitostanol, sitostanol esters, and 
tocopherols, and reduced production of campesterol, 
campestanol, and esters thereof, include, but are not 
limited to, fruits, flowers, seeds, roots, tubers, 
leaves, stems, buds, and other vegetable parts of 
plants. Within seeds, appropriate organ compartments 
include the embryo, the endosperm, and the aleurone 
layer. Within any of the noted target tissues, 
appropriate cellular compartments include, but are not 
limited to, the cell cytoplasm and plastids (e.g., 
proplastids , chloroplast s , chromoplasts , leucoplast s , 
amyloplasts, etc.) . 

B . Vectors 

In plants, transformation vectors capable of 
introducing encoding DNAs involved in sterol compound 
and tocopherol biosynthesis are easily designed, and 
generally contain one or more DNA coding sequences of 
interest under the transcriptional control of 5' and 3' 
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regulatory sequences. Such vectors generally comprise, 
operatively linked in sequence in the 5' to 3' 
direction, a promoter sequence that directs the 
transcription of a dovmstream heterologous structural 
DNA in a plant; optionally, a 5' non- translated leader 
sequence; a nucleotide sequence that encodes a protein 
of interest; and a 3* non- translated region that 
encodes a polyadenylation signal which functions in 
plant cells to cause the termination of transcription 
and the addition of polyadenylate nucleotides to the 3 ' 
end of the mRNA encoding the protein. Plant 
transformation vectors also generally contain a 
selectable marker. Typical 5 '-3* regulatory sequences 
include a transcription initiation start site, a 
ribosome binding site, an RNA processing signal, a 
transcription termination site, and/or a 
polyadenylation signal . Vectors for plant 
transformation have been reviewed in Rodriguez et al . 

(1988) Vectors: A Survey of Molecular Cloning Vectors 
and Their Uses, Butterworths , Boston; Glick et al . 

(1993) Methods in Plant Molecular Biology and 
Biotechnology CRC Press, Boca Raton, Fla; and Croy 

(1993) In Plant Molecular Biology Lab fax, Karnes and 
Rickwood, Eds., BIOS Scientific Publishers Limited, 
Oxford, UK. 

The use of transit peptides, e.g. translational 
fusion peptides, are not preferred for use in 
conjunction with the enzymes of the present invention, 
where the sterol synthethic compounds are present 
primarily in the cellular cytosol . 

V. Cell Transformation and Plant Regeneration 
The amount of a gene coding for a polypeptide 
having HMG-CoA reductase activity is increased by 
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transforming a desired plant cell with a suitable 
vector that contains that added exogenous structural 
gene. Expression of that gene in the transformed plant 
cell and transgenic plants developed from that 
5 transformed plant cell enhances the activity of HMG-CoA 
reductase . 

Methods for transforming polypeptide -coding genes 
into plant cells include AgrroJbacteri urn- mediated plant 
transformation, protoplast transformation, gene 
10 transfer into pollen, injection into reproductive 

organs and injection into immature embryos. Each of 
these methods has distinct advantages and 
disadvantages. Thus, one particular methods of 
introducing genes into a particular plant strain may 
15 not necessarily be the most effective for another plant 
strain, but it is well known which methods are useful 
l_ for a particular plant strain. 

7 i 

m Agrrojbacteriuzn- mediated transfer is a widely 

applicable system for introducing genes into plant 
20 cells because the DNA can be introduced into whole 
plant tissues, thereby bypassing the need for 
regeneration of an intact plant from a protoplast. The 
use of Agrrobacteriiim-mediated plant integrating vectors 
to introduce DNA into plant cells is well known in the 
25 art. See, for example, the methods described by Fraley 
et al.. Biotechnology, 3:629 (1984) and Rogers et al . , 
Methods in Enzymology, 153:253-277 (1987). Further the 
integration of the T8-DNA is a relatively precise 
process resulting in few rearrangements. The region of 
30 DNA to be transferred is defined by the border 

sequences, and intervening DNA is usually inserted into 
the plant genome as described by Spielmann et al . , Mol . 
Gen. Genet., 2 05:34 (1986) and Jorgensen et al , Mol. 
Gen, Genet., 207:471 (1987). 
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Modern Agrobacterium transformation vectors are 
capable of replication in E. coli as well as 
AgrroJbacteriuin, allowing for convenient manipulations as 
described by Klee et al . , in Plant DNA Infectious 
Agents, T. Hohn and J. Schell, eds . , Springer-Verlag, 
New York (1985) pp. 179-203. 

Moreover, recent technological advances in vectors 
for Agrrojbacterium-mediated gene transfer have improved 
the arrangement of genes and restriction sites in the 
vectors to facilitate construction of vectors capable 
of expressing various polypeptide coding genes. The 
vectors described by Rogers et al . , Meth. Enzymol , , 
153:253 (1987), have convenient multi-linker regions 
flanked by a promoter and a polyadenylation site for 
directed expression inserted polypeptide conding genes 
and are suitable for present purposes. 

In addition, Agrobacteria containing both armed 
and disarmed Ti genes can be used for the 
transformations. Both types of transforming systems 
are illustrated herein. Transf ormants from the former 
system result in callus from which the desired squalene 
or sterol can be obtained, whereas transf ormants 
obtained from the latter, disarmed Ti genes can be 
regenerated into complete transgenic plants from whose 
tissues, e.g. leaf, seed and root, the desired 
chemicals can be obtained. 

In those plant strains where Agrobacterium- 
mediated transformation is efficient, it is the method 
of choice because of the facile and defined nature of 
the gene transfer. 

Agrobac terium-mediated transformation of leaf 
disks and other tissues, such as cotyledons and 
hypocotyls, appears to be limited to plant strains that 
Agrojbacateriuin naturally infects. AgrroJbacteriu/n- 
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mediated transformation is the most efficient in 
dicotyledonous plants. Few monocots appear to be 
natural hosts for Agrobacterium, although transgenic 
plants have been produced in the monocot, asparagus, 
using Agrobacterium vectors as described by Bytebier et 
al., Proc. Natl. Acad. Sci . USA, 84:5345 (1987). 
Therefore, commercially important cereal grains such as 
rice, corn, and wheat must usually be transformed using 
alternative methods. However, as mentioned above, the 
transformation of asparagus using Agrobacteriuim can 
also be achieved. 

A transgenic plant formed using AgrroJbacteriuin 
transformation methods typically contains a single gene 
on one chromosome. Such transgenic plants can be 
referred to as being heterozygous for the added gene. 
However, inasmuch as use of the word "heterozygous" 
usually implies the presence of a complementary gene at 
the same locus of the second chromosome of a pair of 
chromosomes, and there is no such gene in a plant 
containing one more than one added gene as here, it is 
believed that a more accurate name for such a plant is 
an independent segregant, because- the added, exogenous 
gene segregates independently during mitosis and 
meiosis. A transgenic plant containing a single 
structural gene that encodes a polypeptide having HMG- 
CoA reductase activity and at least one of the 
enumerated 6 steroid pathway enzymes; i.e., and 
independent segregant, is a preferred transgenic plant. 

More preferred is a transgenic plant that is 
homozygous for the added structural gene; i.e., a 
transgenic plant that contains two added genes, one 
gene at the same locus on each chromosome of a 
chromosome pair. A homozygous transgenic plant can be 
obtained by sexually mating (selfing) an independent 
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segregant transgenic plant that contains the added 
genes according to the invention, germinating some of 
the seed produced and analyzing the resulting plants 
produced for enhanced HMG-CoA reductase activity, 
steroid pathway product accumulation or both, relative 
to a control (native, non- transgenic) or an independent 
segregant transgenic plant . 

A homozygous transgenic plant exhibits enhanced 
HMG-CoA reductase activity as compared to a native, 
non- transgenic plant and an independent segregant 
transgenic plant. 

It is to be understood that two different 
transgenic plants can also be mated to produce 
offspring that contain two independently segregating 
added, exogenous genes. Selfing of appropriate progeny 
can produce plants that are homozygous for both added, 
exogenous genes that encode a polypeptide having HMG- 
CoA reductase activity. Back-crossing to a parental 
plant and outcrossing with a non- transgenic plant are 
also contemplated. 

A. Host Cells and Transformed Plant Cells 
Cells modified according to the present invention 
are contemplated at each stage of the processes of the 
invention. As a result of the invention comprising at 
least two genes, there are several means to accomplish 
that end. In some embodiments of the invention, the 
intermediate steps include transformation of nucleic 
acids comprising some or all of the genes into host 
cells . 

The nucleic acid sequence encoding a polypeptide 
exhibiting HMGR activity does not have to be in the 
same orientation as a nucleic acid sequence encoding a 
polypeptide exhibiting the activity of a steroid 
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pathway enzyme. The coding nucleic acids can be under 
the control of different promoters or be in different 
orientations. For the host plant cell useful in 
carrying out the steroid compound biosynthesis 
according to the invention, the minimum that is 
required is the coding nucleic acids be in the same 
host cell. As long as the HMGR and a steroid pathway 
enzyme coding sequences are present in the same host 
cell, they do not have to be on the same DNA molecule 
or under the control of the same promoter, nor do they 
have to be derived from the same vector or construct. 

Host cells are useful for making, storing, 
reproducing or manipulating nucleic acid constructs of 
the invention. Contemplated host cells are eukaryotic 
cells, such as yeast or plant cells. Any plant cells 
can be utilized with the present invention. Some 
particularly useful agriculturally significant plant 
cells are canola, soybean, corn, maize, tobacco, 
cotton, rape, tomato and alfalfa. Other common plant 
varieties are carrot, barley, arabidopsis, guayule and 
petunia. Prokaryotic host cells containing constructs 
and/or vectors according to the invention are also 
contemplated (e.g. E. coli) . 

One embodiment of the invention is a transformed 
host cell containing inter alia a recombinant construct 
that encodes both a DNA sequence encoding a polypeptide 
exhibiting 3 -hydroxy- 3 -me thylglutaryl -Coenzyme A 
reductase enzyme activity and a DNA sequence encoding a 
steroid pathway enzyme, where the steroid pathway 
enzyme is as described in detail above. In a preferred 
embodiment, those coding DNA sequences are operably 
linked to a promoter and a transcription termination 
signal sequence. In the coding sense direction of the 
construct, the components of the construct are operably 
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linked in the 5' to 3 ' direction as a promoter, the DNA 
sequence encoding sequence and a transcription 
termination signal sequence. 

Another embodiment of the invention is host cell 
that has been transformed with a recombinant vector 
that has such a construct. As discussed herein, in one 
embodiment of the invention, such a recombinant vector 
is a plant expression vector. Preferably such a host 
cell is a plant cell. 

Methods of culturing various eukaryotic and 
prokaryotic cell cultures are well known in the art. 
The present invention contemplates cell cultures of 
transformed host cells. Transformed plant cells 
include transformed protoplasts and other types of host 
cell intermediates as well as plant cell cultures. 

Non-limiting examples of plants that can be used 
in the practice of the invention include, acacia, 
alfalfa, aneth, apple, apricot, artichoke, arugula, 
asparagus, avocado, banana, barley, beans, beet, 
blackberry, blueberry, broccoli, brussel sprouts, 
cabbage, canola, cantaloupe, carrot, cassava, 
cauliflower, celery, cherry, chicory, cilantro, citrus, 
Clementines, coffee, corn, cotton, cucumber, Douglas 
fir, eggplant, endive, escarole, eucalyptus, fennel, 
figs, garlic, gourd, grape, grapefruit, honey dew, 
jicama, kiwifruit, lettuce, leeks, lemon, lime. 
Loblolly pine, mango, melon, mushroom, nectarine, nut, 
oat, oil palm, oil seed rape, okra, onion, orange, 
ornamental plants, papaya, parsley, pea, peach, peanut, 
pear, pepper, persimmon, pine, pineapple, plantain, 
plum, pomegranate, poplar, potato, pumpkin, quince, 
radiata pine, radicchio, radish, raspberry, rice, rye, 
sorghum. Southern pine, soybean, spinach, squash, 
strawberry, sugarbeet , sugarcane, sunflower, sweet 
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potato, sweetgum, tangerine, tea, tobacco, tomato, 
triticale, turf, turnip, watermelon, wheat, yam, and 
zucchini . 

Plants particularly attractive for the steroid 
pathway* modifications described herein include those 
that produce carbon substrates which can be employed 
for synthesis of these compounds. Non-limiting 
examples of such plants include various monocots and 
dicots, including high oil seed plants such as high oil 
seed Brassica (e.g., Brassica nigra, Brassica napus, 
Brassica hirta, Brassica rapa, Brassica campestris , 
Brassica carinata, and Brassica juncea) , soybean 
{GlycinG wax) , castor bean {Ricinus cowmunis) , cotton, 
saf flower {Carthamus tinctorius) , sunflower (Helianthus 
annuus) , flax {Linuw usitatissiwum) , corn (Zea mays) , 
coconut {Cocos nucifera) , palm {Elaeis guineensis) , 
oilnut trees such as olive (Olea europaea) , sesame, and 
peanut {Arachis hypogaea) , as well as Arabidopsis, 
tobacco, wheat, barley, oats, amaranth, potato, rice, 
tomato, and legumes (e.g., peas, beans, lentils, 
alfalfa, etc . ) . 

Enhancement of sitostanol compound production by 
the methods discussed herein is expected to result in 
yields of sitostanol, sitostanol esters, or mixtures 
thereof in an amount of at least about 57% by weight, 
preferably from about 57% to about 90% by weight, and 
more preferably from about 57% to about 65% by weight 
of the total sterol compounds present in seed oil. 
Expressed on a seed dry weight basis, sitostanol, 
sitostanol esters, or mixtures thereof are expected to 
be present in an amount of at least about 0.08%, 
preferably from about 0.08% to about 0.8%, and more 
preferably from about 0.08% to about 0.4% of seed dry 
weight . 
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B . Processes of Transformation 

A variety of different methods can be employed to 
introduce transformation/expression vectors into plant 
protoplasts, cells, callus tissue, leaf discs, 
meristems, etc., to generate transgenic plants. These 
methods include, for example, Agrojbacteriuin-mediated 
transformation, particle gun delivery, microinjection, 
electroporation, polyethylene glycol -mediated 
protoplast transformation, liposome -mediated 
transformation, etc. (reviewed in Potrykus (1991) Ann. 
Rev. Plant Physiol. Plant Mol . Biol. 42: 205). 

In general, transgenic plants comprising cells 
containing and expressing nucleic acids encoding 
enzymes facilitating the modifications in sterol 
compound and tocopherol biosynthesis and accumulation 
described herein can be produced by transforming plant 
cells with a DNA construct as described above via any 
of the foregoing methods; selecting plant cells that 
have been transformed on a selective medium; 
regenerating plant cells that have been transformed to 
produce differentiated plants; and selecting a 
transformed plant that expresses the enzyme -encoding 
nucleotide sequence (s) at a level such that the amount 
of sitosterol, sitosterol esters, sitostanol, 
sitostanol esters, tocopherol compound(s), and 
campesterol/campestanol and their esters is within the 
ranges described herein. 

The encoding DNAs can be introduced either in a 
single transformation event (all necessary DNAs present 
on the same vector) , a co-transformation event (all 
necessary DNAs present on separate vectors that are 
introduced into plants or plant cells simultaneously) , 
or by independent transformation events (all necessary 
DNAs present on separate vectors that are introduced 
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into plants or plant cells independently) . Traditional 
breeding methods can subsequently be used to 
incorporate the desired combination of enzymes into a 
single plant, and to produce hybrid progeny of the 
invention plants. 

Specific methods for transforming a wide variety 
of dicots and obtaining transgenic plants are well 
documented in the literature (Gasser and Fraley (1989) 
Science 244: 1293; Fisk and Dandekar (1993) Scientia 
Horticulturae 55: 5; Christou (1994) Agro Food Industry 
Hi Tech, p. 17; and the references cited therein) . 

Apomixis is a genetically controlled method of 
reproduction in plants where the embryo is formed 
without union of an egg and a sperm. There are three 
basic types of apomictic reproduction: 1) apospory 
where the embryo develops from a chromosomal ly 
unreduced egg in an embryo sac derived from the 
nucellus, 2) diplospory where the embryo develops from 
an unreduced egg in an embryo sac derived from the 
megaspore mother cell, and 3) adventitious embryony 
where the embryo develops directly from a somatic cell . 
In most forms of apomixis, psuedogamy or fertilization 
of the polar nuclei to produce endosperm is necessary 
for seed viability. In apospory, a "nurse" cultivar 
can be used as a pollen source for endosperm formation 
in seeds. The nurse cultivar does not affect the 
genetics of the aposporous apomictic cultivar since the 
unreduced egg of the cultivar develops 
parthenogenetically , but makes possible endosperm 
production . 

Apomixis is economically important, especially in 
transgenic plants, because it causes any genotype, no 
matter how heterozygous, to breed true. Thus, with 
apomictic reproduction, heterozygous transgenic plants 
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can maintain their genetic fidelity throughout repeated 
life cycles. Methods for the production of apomictic 
plants are known in the art. See, U.S. Patent No. 
5,811,636 and references cited therein which are herein, 
incorporated by reference. 

The present invention also encompasses uniform 
populations of any of the plants discussed herein. 

Successful transformation and plant regeneration 
have been achieved in the monocots as follows : 
asparagus {Asparagus officinalis; Bytebier et al . 
(1987) Proc, Natl. Acad. Sci. USA 84: 5345); barley 
{Hordex2m vulgarae; Wan and Lemaux (1994) Plant Physiol. 
104: 37); maize (Zea mays; Rhodes et al . (1988) Science 
240: 204; Gordon-Kamm et al . (1990) Plant Cell 2: 603; 
Fromm et al . (1990) Bio/Technology 8: 833; Koziel et 
al. (1993) Bio/Technology 11: 194); oats {Avena sativa; 
Somers et al . (1992) Bio/Technology 10: 1589); 
orchardgrass {Dactylis glomerata; Horn et al . (1988) 
Plant Cell Rep. 7: 469); rice {Oryza sativa, including 
indica and japonica varieties; Toriyama et al . (1988) 
Bio/Technology 6: 10; Zhang et al . (1988) Plant Cell 
Rep. 7: 379; Luo and Wu (1988) Plant Mol . Biol. Rep. 6: 
165; Zhang and Wu (1988) Theor. Appl . Genet. 16: 835; 
Christou et al . (1991) Bio/Technology 9: 957); rye 
{Secale cereale; De la Pena et al . (1987) Nature 325: 
274); sorghum (Sorghum bicolor; Cassas et al . (1993) 
Proc. Natl. Acad. Sci. USA 90: 11212); sugar cane 
{Saccharum spp. ; Bower and Birch (1992) Plant J. 2: 
409); tall fescue {Festuca arundinacea; Wang et al . 
(1992) Bio/Technology 10: 691); turfgrass {Agrostis 
palustris; Zhong et al . (1993) Plant Cell Rep. 13: 1); 
and wheat {Triticum aestixnmi; Vasil et al . (1992) 
Bio/Technology 10: 667; Weeks et al . (1993) Plant 
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Physiol. 102: 1011 ; Becker et al . (1994) Plant J. 5: 
299) . 

Plant transformation vectors capable of delivering 
DNAs (genomic DNAs, plasmid DNAs, cDNAs, or synthetic 
DNAs) encoding plant -derived or other enzymes that 
affect the biosynthesis and accumulation of sterol 
compounds and tocopherols in plants for optimizing the 
pools of sitosterol, sitostanol, esters of either, and 
tocopherols, and for reducing the levels of 
campesterol, campestanol, and/or their esters, can be 
easily designed by art-recognized methods. Various 
strategies can be employed to introduce these encoding 
DNAs into plants to produce transgenic plants that 
biosynthesize and accumulate desirable levels of 
various sterol compounds and tocopherols, including: 

1 . Transforming individual plants with an 
encoding DNA of interest. Two or more transgenic 
plants, each containing one of these DNAs, can then be 
grown and cross-pollinated so as to produce hybrid 
plants containing the two DNAs. The hybrid can then be 
crossed with the remaining transgenic plants in order 
to obtain a hybrid plant containing all DNAs of 
interest within its genome. 

2. Sequentially transforming plants with plasmids 
containing each of the encoding DNAs of interest, 
respectively . 

3 . Simultaneously cotransf orming plants with 
plasmids containing each of the encoding DNAs, 
respectively . 

4. Transforming plants with a single plasmid 
containing two or more encoding DNAs of interest . 

5. Transforming plants by a combination of any of 
the foregoing techniques in order to obtain a plant 
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that expresses a desired combination of encoding DNAs 
of interest. 

Traditional breeding of transformed plants 
produced according to any one of the foregoing methods 
by successive rounds of crossing can then be carried 
out to incorporate all the desired encoding DNAs in a 
single homozygous plant line (Nawrath et al . (1994) 
Proc, Natl. Acad. Sci . USA 91: 12760; PCT International 
Publication WO 93/02187) , or to produce hybrid 
offspring. 

In methods 2 and 3, the use of vectors containing 
different selectable marker genes to facilitate 
selection of plants containing two or more different 
encoding DNAs is advantageous. Examples of useful 
selectable marker genes include those conferring 
resistance to kanamycin, hygromycin, sulphonamides , 
glyphosate, bialaphos, and phosphinothricin . 

C . Processes of Recreneration 
Processes of regeneration of plants from 

transformed protoplasts are known in the art. 

D . Transgenic Plants and Progeny 

The present invention contemplates the plants that 
contain the exogenous constructs according to the 
present invention, such that a plant comprises at least 
one transformed plant cell comprising a nucleic acid 
construct. The nucleic acid construct, as described 
above has as operably linked components in the 5* to 3' 
direction, a promoter, a DNA sequence encoding a 
polypeptide exhibiting 3 -hydroxy-3 -methylglutaryl - 
Coenzyme A reductase enzyme activity, and a 
transcription termination signal sequence. The plant 
also comprises a nucleic acid construct that has as 
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operably linked components in the 5 ' to 3 ' direction, a 
promoter, a DNA sequence encoding a steroid pathway- 
enzyme, and a transcription termination signal 
sequence. The steroid pathway enzyme is a squalene 
epoxidase enzyme, a sterol methyl transferase I enzyme, 

a sterol C4 -demethylase enzyme, a obtusifoliol C14a- 
demethylase enzyme, a sterol C5-desaturase enzyme, or a 
sterol methyl transferase II enzyme. In one 
embodiment, the nucleic acid constructs are recombinant 
constructs . 

In one embodiment of the present invention a 
transgenic plant can be produced in accordance with the 
processes discussed elsewhere herein. One method to 
arrive at the above construct-containing plant is to 
transform the plant cell with a recombinant vector 
harboring such a construct . Other methods involve 
direct transfer of the exogenous construct into the 
plant cell. The methods of arriving at a plant cell 
having exogenous nucleic acids are well known in art 
and are applicable to the present invention. In one 
embodiment, the nucleic acid constructs are recombinant 
constructs. In a preferred embodiment, the recombinant 
vector is a plant expression vector. 

The present invention contemplates a plant, the 
genome of which comprises introduced DNA. That 
introduced DNA has at least two components. One 
component is a DNA encoding a 3-hydroxy-3- 
methyulglutaryl -Coenzyme A reductase enzyme. The other 
component is DNA encoding a steroid pathway enzyme that 
is a squalene epoxidase enzyme, a sterol methyl 
transferase I enzyme, a sterol C4 -demethylase enzyme, a 

obtusifoliol C14a-demethylase enzyme, a sterol C5- 
desaturase enzyme, or a sterol methyl transferase II 
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enzyme. The storage organs, preferably seeds, of such 
a plant contain an elevated level of total accumulated 
sterol, compared to storage organs of an otherwise 
identical plant, the genome of which does not comprise 
the introduced DNA. The introduced DNAs are 
operatively linked to regulatory signals, preferably 
that cause seed- specif ic expression of said introduced 
DNAs. The seeds of such a plant contain a reduced 
level of squalene, cycloartenol , 24 -methylene 
cycloartenol , obtusif oliol , stigmasta-7 -enol , or 
campesterol compared to the seeds of an otherwise 
identical plant or compared to a plant comprising an 
introduced DNA encoding a HMG CoA reductase enzyme 
without the contemplated steroid pathway enzyme. 

Also contemplated is a plant with introduced DNA, 
as described above, that produces seed having an 
elevated level of a steroid pathway product, compared 
to a corresponding transgenic or non- transgenic plant 
that does not contain said introduced DNA, 

The invention also contemplates a plant comprising 
introduced DNA encoding (i) an HMGR enzyme and (ii) a 
squalene epoxidase enzyme, a sterol methyl transferase 
I enzyme, a sterol C4 -demethylase enzyme, a 

obtusif oliol C14a-demethylase enzyme, a sterol C5- 
desaturase enzyme, a sterol methyl transferase II 
enzyme, or mixtures thereof, wherein said plant that 
produces a storage organ (preferably a seed) having an 
elevated level of a sterol pathway product compared to 
a corresponding transgenic or non- transgenic plant that 
does not contain said introduced DNA. 

The invention also contemplates a plant having 
introduced DNA, as described above, that produces a 
storage organ (preferably a seed) having a reduced 
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level of squalene, cycloartenol , 24 -methylene 
cycloartenol , obtusif oliol , stigmasta-7-enol , 
campesterol , or mixtures thereof, compared to a 
corresponding transgenic plant that comprises 
introduced DNA encoding an HMGR enzyme but that does 
not contain introduced DNA encoding a squalene 
epoxidase enzyme, a sterol methyl transferase I enzyme, 

a sterol C4 -demethylase enzyme, a obtusif oliol C14a- 
demethylase enzyme, a sterol C5-desaturase enzyme, a 
sterol methyl transferase II enzyme, or mixtures 
thereof . 

For any of the above plants, an embodiment is 
contemplated wherein the introduced nucleic acid has 
regulatory signals that cause seed-specific expression 
of said introduced DNAs . 

The progeny of the above-described plants are also 
considered an embodiment of the present invention, as 
are plant cells or transformed plant cells. Cultures 
of those plant cells are also contemplated. Plants 
produced from seeds having introduced DNA are also 
embodiments of the present invention. 

A further embodiment of the present invention is a 
method of producing a plant that accumulates an 
elevated level of sterol pathway products, in seed of 
said plant compared to seed of a corresponding plant 
comprising no introduced DNA encoding a peptide, 
polypeptide, or protein that affects the biosynthesis 
and accumulation of a sterol pathway product, 
comprising sexually crossing a plant having introduced 
nucleic acid with the corresponding plant having no 
introduced DNA. Plants, including apomictic plants 
produced by this method are contemplated. 
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Another embodiment is a seed resulting from a 
cross of a plant having introduced DNA, described 
above, with a nurse cultivar. Also contemplated are 
seeds of any of the above-described plants. Also part 
of the present invention plant parts, other than a seed 
of any of the above -described plants. 

Uniform populations of the above -described plants 
are also contemplated. 

E . Stability of Transgene Expression 
As several overexpressed enzymes may be required 
to produce optimal levels of substrates for the 
biosynthesis of sterol compounds and tocopherols, the 
phenomenon of co- suppression may influence transgene 
expression in transformed plants. Several strategies 
can be employed to avoid this potential problem 
(Finnegan and McElroy (1994) Bio/Technology 12: 883). 

One commonly employed approach is to select and/or 
screen for transgenic plants that contain a single 
intact copy of the transgene or other encoding DNA 
(Assaad et al . (1993) Plant Mol . Biol. 22: 1067; 
Vaucheret (1993) C.R, Acad. Sci. Paris, Science de la 
vie/Life Sciences 316: 1471; McElroy and Brettell 
(1994) TIBTECH 12: 62), AgrroJbacteri urn -media ted 
transformation technologies are preferred in this 
regard . 

Inclusion of nuclear scaffold or matrix attachment 
regions (MAR) flanking a transgene has been shown to 
increase the level and reduce the variability 
associated with transgene expression in plants (Stief 
et al. (1989) Nature 341: 343; Breyne et al . (1992) 
Plant Cell 4: 463; Allen et al . (1993) Plant Cell 5: 
603); Mlynarova et al . (1994) Plant Cell 6: 417; Spiker 
and Thompson (1996) Plant Physiol. 110: 15). Flanking a 
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transgene or other encoding DNA with MAR elements may 
overcome problems associated with differential base 
composition between such transgenes or encoding DNAs 
and integrations sites, and/or the detrimental effects 
5 of sequences adj acent to transgene integration sites . 

The use of enhancers from tissue-specific or 
developmentally-regulated genes may ensure that 
expression of a linked transgene or other encoding DNA 
occurs in the appropriately regulated manner. 
10 The use of different combinations of promoters, 

plastid targeting sequences , and selectable markers for 
y introduced transgenes or other encoding DNAs can avoid 

W potential problems due to trans- inacti vat ion in cases 

fr^ where pyramiding of different transgenes within a 

15 single plant is desired. 

Finally, inactivation by co-suppression can be 
avoided by screening a number of independent transgenic 
plants to identify those that consistently overexpress 
particular introduced encoding DNAs (Register et al . 
20 (1994) Plant Mol, Biol. 25: 951). Site-specific 

recombination in which the endogenous copy of a gene is 
replaced by the same gene, but with altered expression 
characteristics, should obviate this problem (Yoder and 
Goldsbrough (1994) Bio/Technology 12: 263). 
25 Any of the foregoing methods, alone or in 

combination, can be employed in order to insure the 
stability of transgene expression in transgenic plants 
of the present invention. 
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30 F . Hybrid Plants 

The invention contemplates a plant having 
introduced DNA encoding an HMGR and at least one of the 
six steroid pathway enzymes, as described in detail 
above. It is contemplated that a transgenic plant 
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having DNA encoding an HMGR, as is known in the art, 
might be crossed with a transgenic plant having DNA 
encoding at least one of the six steroid pathway 
enzymes . 

Also contemplated as a hybrid plant according to 
the invention is a plant that is a hybrid of a 
transgenic plant having introduced DNA encoding an HMGR 
and at least one of the six steroid pathway enzymes 
wherein the plant has been hybridized with another 
strain, yet still retains the introduced DNA. 

G . Storage Organs 

The term "storage organ" as used herein, refers to 
the seeds, fruits or vegetable parts of a plant. Most 
often the seed is important for use in the present 
invention. However, there are consumable embodiments, 
such as with potatoes or carrots, where the vegetable 
parts are preferred. 

A contemplated embodiment of the present invention 
is a storage organ comprising at least one transformed 
host cell. The transformed host cell has at a minimum 
a construct according the invention as described above. 
Also contemplated are the embodiments when the 
construct has plant promoters, when the construct is 
recombinant, when the construct is part of a vector, 
and when the vector is a plant expression vector. 

The invention contemplates a transgenic plant seed 
transformed with a vector comprising a DNA segment that 
encodes a polypeptide having HMG-CoA reductase 
activity, and a DNA segment that encodes a polypeptide 
having a steroid pathway enzyme, wherein the transgenic 
plant seed is capable of germinating into a transgenic 
plant that over-accumulates steroid pathway products 
relative to a native, non- transgenic plant of the same 
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strain; and mutants, recombinants, genetically 
engineered derivatives thereof and hybrids derived 
therefrom, wherein said mutants, recombinants, 
genetically engineer derivatives thereof and hybrids 
derived therefrom maintain the ability to 
overaccumulate steroid pathway products. 

Seed from a transgenic plant is grown in the field 
or greenhouse, and resulting sexually mature transgenic 
plants are self -pollinated to generate true breeding 
plants. The progeny from these plants become true 
breeding lines that are evaluated for steroid compound 
or squalene accumulation, preferably in the field under 
a range of environmental conditions. 

The commercial value of a transgenic plant with 
increased steroid compound or squalene accumulation is 
enhanced if many different hybrid combinations are 
available for sale. The user typically grows more than 
one kind of hybrid based on such differences as time to 
maturity, standability or other agronomic traits. 
Additionally, hybrids adapted to one part of a country 
are not necessarily adapted to another part because of 
differences in such traits as maturity, disease and 
herbicide resistance. Because of this, steroid 
compound or squalene accumulation is preferably bred 
into a large number of parental lines so that many 
hybrid combinations can be produced. 

Adding an enhanced steroid compound or squalene 
accumulation trait to an agronomically elite line is 
accomplished by a variety of techniques well known to 
those of skill in the art. For example, parent 
transgenic plants that are either homozygous or contain 
a single independent segregatable gene that encodes a 
polypeptide having HMG-CoA activity and thus for 
enhanced sterol or squalene accumulation are crossed 
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with lines having other desirable traits such as 
herbicide resistance (U.S. Pat. No. 4,761,373) produce 
hybrids. Preferably, transgenic plants homozygous for 
enhanced sterol or squalene accumulation are used to 
generate hybrids . 

For example, a transgenic plant homozygous for 
enhance sterol accumulation is crossed with a parent 
plant having other desired traits. The progeny, which 
are heterozygous or independently segregatable for 
enhanced sterol accumulation and their other desired 
traits. The backcrossing of progeny with the parent 
may have to be repeated more than once to obtain a 
transgenic plant that possesses all desirable traits. 

Alternatively, transgenic plants with an enhanced 
sterol or squalene accumulation trait are made multiply 
transgenic by introducing into such plants other genes 
that encode and express other desirable traits, or are 
mutated as with radiation, e.g. X-rays or gamma rays, 
as in U.S. Pat. No. 4,616,099, whose disclosures are 
incorporated by reference. Thus, the present invention 
also contemplates mutants and genetically engineered 
derivatives of transgenic plants having enhanced sterol 
or squalene accumulation. 

VI . Harvest 

Besides seed,, elevated levels of sterols, 
phytosterols , such as sitosterol, phytostanols , such as 
sitostanol, and esters thereof, can be found in other 
parts of the plants encompassed herein. While the 
seed- specif ic promoters contemplated in the present 
invention function preferentially in seed tissues, 
expression in other plant parts can be expected, 
depending upon the specificity of the particular 
promoter. In this case, promoters functional in plant 
plastids are less desirable than those primarily 
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directing expression in the cellular cytosol, though it 
may be desirable to use promoters to drive expression 
of the recombinant constructs or expression cassettes 
disclosed herein in tissues and organs other than 
5 seeds. For example, elevated levels of sterols, 

phytosterols , etc., can be expected in fruits, as well 
as vegetable parts of plants other than seeds. 
Vegetable parts of plants include, for example, pollen, 
inflorescences, terminal buds, lateral buds, stems, 
10 leaves, tubers, and roots. Thus, the present invention 
also encompasses these and other parts of the plants 
disclosed herein that contain elevated levels of 
desirable phytosterol, and phytostanol . 
fl Of course, a significant effect of introducing 

15 into plants the coding sequences disclosed herein will 
be on the content of phytosterols/phytostanols and 
their esters of seed oil. Therefore, additional 
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fu aspects of the present invention include oil obtainable 

from the seed of the plants described herein, and 
20 methods for producing such plants and oil. Methods for 
extracting and processing seed oils are well known in 
the art . 

Oils produced by the cells, plants, and methods 
disclosed herein are superior in 
25 phytosterol/phytostanol composition to conventional 
oils in a variety of ways. Oil of the present 
invention can contain an elevated level of at least one 
sterol, at least one phytosterol, at least one 
phytosterol ester, at least one phytostanol, at least 
30 one phytostanol ester, or mixtures thereof. Preferred 
compounds include sitosterol, sitostanol, and their 
esters . 

Oil from seed of plants containing and expressing 
introduced DNA encoding a sterol methyl transferase II 
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enzyme advantageously contains a reduced level of 
campesterol , at least one campesterol ester, 
campestanol , at least one campestanol ester, or 
mixtures thereof. The sterol methyltransf erase II- 
5 encoding DNA can be introduced alone, or in combination 
with other introduced DNA sequences encoding enzymes 
affecting the biosynthesis of steroid compounds as 
discussed herein. Campesterol/campestanol and their 
esters are considered to be undesirable because they 

10 are readily absorbed in the intestine, while their 

safety in the blood is unknown. Employing the plants 
and methods disclosed herein, one can obtain seed oil 
comprising about 0% to about 19%, preferably about 0% 
to about 12%, more preferably about 5% to about 9% 

15 campesterol, at least one campesterol ester, 

campestanol, at least one campestanol ester, or 
mixtures thereof by weight of the total sterol 
compounds of the oil. (The levels of these compounds 
are difficult to express on a percent seed dry weight 

20 basis because different seeds contain different 

percentages of these compounds expressed on this basis) 
These values represent a reduction of about 10% to 
about 100% in the amount of these compounds compared to 
those in conventional oils. 

25 Introduction into plant cells of the enzyme- 

encoding DNA sequences discussed above modifies the 
biosynthesis of sterol compounds carried out by the 
methods, and in the cells, plants, and seeds, disclosed 
herein. In particular, the expression of an HMG CoA 

30 reductase in conjunction with DNA sequences for a 
steroid pathway enzyme is expected to result in 
alteration of the steroid pathway product profiles in 
oil as the enhanced steroid pathway throughput produces 
substrates for the enhanced enzyme activity. The novel 
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phytostanol ester compositions, e.g., sitostanol ester 
compositions, thus produced constitute another aspect 
of the present invention. 

A. HajTvest of Steroid Compounds 

Methods for the derivation of steroid compounds 
from cells are well known in the art. The invention 
contemplates the recovery of biosynthesized steroid 
compounds from the leaves and/or stems of plants, from 
plant seeds, from plant * s vegetative organs, from 
callouses, and from cell cultures of plants, yeasts or 
eukaryotic cells. 

Different sources of steroid compounds are 
preferred for various plants. For use as a food or a 
food component as discussed later, the steroid 
compounds need not be isolated or purified to 100 
percent purity. Steroid compound- enriched plants may 
be utilized directly. 

For example, from tobacco or Arahidopsis, it may 
be preferable to extract a pulp of the leaves and 
stems. From tomato, potato, or corn, it may be 
preferable to use the tomato, potato or corn in the 
form of the familiar storage organs that are typically 
consumed either directly, or a derivative thereof, such 
as tomato paste, potato flakes, vegetable oil and many 
more that are well known in the food science arts. 

If desired, after cultivation, the transgenic 
plant is harvested to recover the sterol or squalene 
product. This harvesting step can consist of 
harvesting a callus culture, the entire plant, or only 
the leaves, or roots of the plant. This step can 
either kill the plant or, if only a non-essential 
portion of the transgenic plant is harvested, can 
permit the remainder of the plant to continue to grow. 



114 MTC 6783.1 

PATENT 

In preferred embodiments, this harvesting step 
further comprises the steps of: 

(i) homogenizing at least a sterol - 
containing or a squalene-containing 
portion of the transgenic plant to 
produce a plant pulp and using the 
sterol -or squalene-containing pulp 
directly, as in dried pellets or tablets 
as where an animal food is contemplated; 
or 

(ii) extracting the squalene or 
sterol (s) from the plant pulp with an. 
appropriate solvent such as an organic 
solvent or by supercritical extraction 

[Favati, et al . , J". Food Sci . 53:1532 

(1988) and the citations therein] to 
produce a sterol-or squalene-containing 
liquid solution or suspension; and 

(iii) isolating the squalene or 
sterol (s) from the solution or 
suspension . 

At least a portion of the transgenic plant is 
homogenized to produce a plant pulp using methods well 
known to one skilled in the art. This homogenization 
can be done manually, by a machine, or by a chemical 
means as long as the transgenic plant portions are 
broken up into small pieces to produce a plant pulp. 
This plant pulp consists of a mixture of squalene or 
the sterol of interest, residual amounts of precursors, 
cellular particles and cytosol contents. This pulp can 
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be dried and compressed into pellets or tablets and 
eaten or otherwise used to derive the benefits, or the 
pulp can be subjected to extraction procedures. 

The sterol or squalene can be extracted from the 
5 plant pulp produced above to form a sterol -or-squalene- 
containing solution or suspension. Such extraction 
processes are common and well known to one skilled in 
this art. For example, the extracting step can consist 
of soaking or immersing the plant pulp in a suitable 

10 solvent. This suitable solvent is capable of 

dissolving or suspending the squalene or sterol present 
in the plant pulp to produce a sterol -or squalene- 
containing solution or suspension. Solvents useful for 
such an extraction process are well known to those 

15 skilled in the art and included several organic 

solvents and combinations thereof such as methanol, 
ethanol, isopropanol, acetone, acetonitrile , 
tetrahydrof uran (THF) , hexane, and chloroform as well 
as water-organic solvent mixtures. A vegetable oil 

20 such as peanut, corn, soybean and similar oils can also 
be used for this extraction as can steam distillation. 

A whole plant or callus culture with an added, 
exogenous structural gene for a polypeptide having HMG- 
CoA reductase activity is grown under suitable 

25 conditions for a period of time sufficient for squalene 
or sterols to be synthesized and accumulated. The 
sterol -squalene-containing plant cells, preferably in 
dried form, are then lysed chemically or mechanically, 
and the squalene or sterol is extracted from the lysed 

30 cells using a liquid organic solvent or steam 

distillation, as described before, to form a sterol- or 
squalene-containing liquid solution or suspension by 
usual means such as chromatography. 



116 MTC 6783.1 

PATENT 

The squalene or sterol is isolated from the 
solution or suspension produced above using methods 
that are well known to those skilled in the art of 
squalene and sterol isolation. These methods include, 
but are not limited to, purification procedures based 
on solubility in various liquid media, chromatographic 
techniques such as column chromatography and the like. 

The invention contemplates a sitosterol or 
sitostanol ester composition extracted from the seed of 
a transgenic plant of the invention. The invention 
also contemplates such a sitosterol or sitostanol ester 
wherein an esterifying fatty acid has 2 to 22 carbon 
atoms in the main chain. 

B . Harvest of Oil 

The novel biosynthetic composition of the oil in 
the transgenic plants is contemplated. Thus, the 
present invention contemplates oil containing at least 
one sterol pathway product, extracted from seed of a 
described transgenic plant. Preferably, sitosterol, at 
least one sitosterol ester, or mixtures thereof, 
comprise at least about 50% by weight of the total 
sterol compounds of the oil. Preferably, sitosterol, 
at least one sitosterol ester, or mixtures thereof, 
comprise at least about 0.08% of the dry weight of a 
contemplated seed. Preferably, the oil has a reduced 
amount of squalene, cycloartenol , 24-methylene 
cycloartenol , obtusif oliol , stigmasta-7-enol , 
campesterol , or mixtures thereof, compared to oil from 
a corresponding transgenic plant that does not contain 
introduced DNA encoding a squalene epoxidase enzyme, a 
sterol methyl transferase I enzyme, a sterol C4- 

demethylase enzyme, a obtusif oliol C14a-demethylase 
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enzyme, a sterol C5-desaturase enzyme, a sterol methyl 
transferase II enzyme, or mixture thereof, and that 
reduction is in the range of from about 10% to about 
100% . 

5 Oil is extracted from transgenic plant seeds of 

the present invention by method well known in the art. 
By way of example, oil can be extracted from plant 
seeds using extraction methods set forth above for 
harvesting sterols and squalene from transgenic plants. 
10 Alternatively, oil can be extracted from transgenic 
^ plant seeds by usually used methods for obtaining seed 

^ oils such as by crushing he seeds to produced a pulp 

m and then pressing the pulp to obtain oil. The pulp can 

fl also be extracted with appropriate solvents (e.g. 

fU 15 benzene) to obtain the oil. Industrial Chemistry: A 

ill 

2" Manual for the Student and Manufacturer, ed. By A. 

Q Rogers and A. B. Aubert, D. Van Nostrand Co., New York, 

Rj pages 547-548 (1912) . 

M= 20 C. Uses of Oil 

As discussed in the "Description of Related Art," 
phytostanols such as sitostanol are beneficial for 
lowering serum cholesterol (Ling et al . (1995) Life 
Sciences 57: 195-206) and preventing cardiac disease. 

25 Tocopherols act as antioxidants, and play a major role 
in protecting cells from damage caused by free radicals 
(Halliwell (1997) Nutrition Rev^iew 55: 44-60). As the 
amount of sitostanol in conventional vegetable and bran 
oils is low relative to that of other sterol compounds, 

30 the oils of the present invention are particularly 

useful for reducing the concentration of low density 
lipoprotein cholesterol in plasma. 

Thus, further aspects of the present invention 
include the following: 



118 MTC 6783.1 

PATENT 

Cholesterol -lowering compositions comprising the 
oils and sitostanol ester compositions disclosed 
herein. Such cholesterol -lowering compositions can 
take the form of, or be used in, foods, food products, 
processed foods, food ingredients, food additive 
compositions, or dietary supplements that contain oils 
and/or fats. Non-limiting examples include margarines; 
butters; shortenings; cooking oils; frying oils; 
dressings, such as salad dressings; spreads; 
mayonnaises; and vitamin/mineral supplements. Patent 
documents relating to such compositions include U.S. 
Patents 4,588,717 and 5,244,887, and PCT International 
Publication Nos . WO 96/38047, WO 97/42830, WO 98/06405, 
and WO 98/06714. Additional non-limiting examples 
include toppings; dairy products such as cheese and 
processed cheese; processed meat; pastas; sauces; 
cereals; desserts, including frozen and shelf -stable 
desserts; dips; chips; baked goods; pastries; cookies; 
snack bars; confections; chocolates; beverages; 
unextracted seed; and unextracted seed that has been 
ground, cracked, milled, rolled, extruded, pelleted, 
defatted, dehydrated, or otherwise processed, but which 
still contains the oils, etc., disclosed herein. 

Food additive compositions of the present 
invention can be made by a method comprising obtaining 
oil containing a phytostanol or phytostanol ester 
selected from sitostanol, at least one sitostanol 
ester, or mixtures thereof, from cultured cells, or 
seeds of a plant, of the present invention, and evenly 
distributing the oil or desired phytostanol compound 
in finely divided form throughout the food product or 
food additive composition to which it is added by 
dissolution or by suspension in an emulsion. For 
example, the oil or phytostanol compound can be 
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dissolved in an edible solubilizing agent, or can be 
mixed with an edible solubilizing agent, an effective 
amount of a dispersant, and optionally, an effective 
amount of an antioxidant. Examples of useful edible 
solubilizing agents include, but are not limited to, 
monoglycerides , diglycerides , triglycerides , vegetable 
oils, tocopherols, alcohols, polyols, or mixtures 
thereof. Examples of useful antioxidants include, but 
are not limited to, tocopherols, such as -tocopherol, 
ascorbic acid, inexpensive synthetic antioxidants, and 
mixtures thereof. Effective carriers for preparing 
emulsions or suspensions include water, alcohols, 
polyols, other edible compounds in which the oil or 
phytostanol compound is soluble or insoluble, and 
mixtures thereof. Examples of useful dispersants 
include, but are not limited to, lecithin, other 
phospholipids, sodium lauryl sulfate, fatty acids, 
salts of fatty acids, fatty acid esters, other 
detergent -like molecules, and mixtures thereof. 
Alternatively, the food additive composition can be 
made by a method comprising obtaining oil containing at 
least one tocopherol, and a phytostanol or phytostanol 
ester selected from sitostanol, at least one sitostanol 
ester, and mixtures thereof, from cultured cells, or 
seed of a plant, of the present invention, and mixing 
the oil with an edible solubilizing agent and an 
effective amount of a dispersant. Again, the edible 
solubilizing agent can include, but is not limited to, 
monoglycerides , diglycerides , triglycerides , vegetable 
oils, tocopherols, alcohols, polyols, or mixtures 
thereof, and the dispersant can include, but is not 
limited to, lecithin, other phospholipids, sodium 
lauryl sulfate, fatty acids, salts of fatty acids. 
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fatty acid esters, other detergent -like molecules, and 
mixtures thereof . 

The cholesterol - lowering compositions can also 
take the form of pharmaceutical compositions comprising 
a cholesterol -lowering effective amount of the oils or 
sitostanol ester compositions disclosed herein, along 
with a pharmaceutically acceptable carrier, excipient, 
or diluent. These pharmaceutical compositions can be 
in the form of a liquid or a solid. Liquids can be 
solutions or suspensions; solids can be in the form of 
a powder, a granule, a pill, a tablet, a gel, or an 
extrudate. U.S. Patent 5,270,041 relates to sterol- 
containing pharmaceutical compositions . 

Any of the foregoing cholesterol -lowering 
compositions can be used alone or in combination in 
methods to lower the risk of developing an elevated 
plasma concentration of low density lipoprotein 
cholesterol, to lower the plasma concentration of low 
density lipoprotein cholesterol, or to treat or prevent 
an elevated plasma concentration of low density 
lipoprotein cholesterol. Such methods comprise orally 
administering to a human or animal subject an effective 
amount of cholesterol -lowering composition. What 
constitutes an effective amount of cholesterol -lowering 
composition can be determined empirically, and depends 
in part on a variety of factors, including the age, 
weight, sex, diet, general medical condition of the 
subject, and the severity of hypercholesterolemia. 
Subjects undergoing treatment with the cholesterol - 
lowering combinations disclosed herein can be monitored 
by routine measurement of serum cholesterol levels to 
determine the effectiveness of therapy. Continuous 
analysis of the data obtained in this way permits 
modification of the treatment regimen during therapy so 
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that optimal effective amounts of the cholesterol - 
lowering compositions of this invention are 
administered, and so that the duration of treatment can 
be determined as well. In this way, the treatment 
5 regimen/dosing schedule can be rationally modified over 
the course of treatment so as to achieve the lowest 
cholesterol -lowering effective amount of the present 
compositions which results in satisfactory anti- 
cholesterolemic effectiveness, and so that 

10 administration of these compositions is continued only 
so long as is necessary to successfully treat this 
condition. In general, an effective amount of a 
cholesterol -lowering composition of the present 
invention in the form of a phytostanol- or phytostanol 

15 ester-containing composition is in the range of from 
about 0.1 gm/day to about 4.5 gm/day. By way of 
example, a phytostanol ester composition, for example a 
sitostanol ester composition, can be administered in an 
amount in the range of from about 0.1 gm/day to about 

20 4.5 gm/day, preferably from about 1 gm/day to about 4.5 
gm/day, more preferably from about 2 gm/day to about 
4.5 gm/day. A phytostanol composition, for example a 
sitostanol composition, can be administered in an 
amount in the range of from about 0.1 gm/day to about 3 

25 gm/day, preferably from about 1 gm/day to about 3 

gm/day, more preferably from about 2 gm/day to about 3 
gm/day. 

The cholesterol-lowering compositions of the 
present invention can be administered daily to patients 
30 in accordance with a number of different regimens. 
Fundamentally, these compositions should be 
administered in a cholesterol -lowering effective amount 
for a period of time effective to exert their anti- 
hypercholesterolemic preventing, reducing, or reversing 
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action. Administration of the present cholesterol - 
lowering compositions should be continued until the 
hypercholesterolemic condition has been controlled or 
eliminated . 

Another method encompassed by the present 
invention is that of achieving or improving effective 
absorption of sitostanol into a host, comprising 
producing at least one sitostanol ester by any of the 
methods disclosed herein, and administering this 
sitostanol ester to a host, which can be a human or 
animal. The sitostanol ester can be administered by a 
route selected from oral route, parenteral route, or 
topical route. The dose, which can be administered 
daily, can be up to about 10 milligrams of the 
sitostanol ester per kilogram of body weight. U.S. 
Patent 5,202,045 relates to the use of stanol fatty 
acid esters to reduce serum cholesterol . 

Also envisioned are plants which in addition to 
having increased levels of phytosterols and 
phytostanols due to the presence of constructs 
comprising sequences encoding 3 -hydroxy- 3 - 
me thylglutaryl -Coenzyme A reductase and at least one 
other sterol synthesis pathway enzyme, have increase 
levels of tocopherol due to the presence of constructs 
comprising sequences allowing the overexpression of 
enzymes in the tocopherol biosynthetic pathway. 

Tocopherol levels vary in different plants, 
tissues, and developmental stages, indicating a highly 
regulated biosynthetic pathway. The production of 
homogentisic acid by p-hydroxyphenylpyruvate 
dioxygenase is likely to be a regulatory point for bulk 
flow through the pathway because of irreversible enzyme 
action and because homogentisic acid production is the 
first committed step in tocopherol biosynthesis (Norris 
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et al., 1995, Plant Cell 7: 2139-2149). The other key 
regulatory step in tocopherol biosynthesis is the 
availability of the phytylpyrophosphate pool. Feeding 
studies (Furuya et al . , 1987, Phytochem. , 26: 2741- 
5 2747) in saf flower callus culture demonstrated 1.8-fold 
and 18 -fold increases in tocopherol synthesis by 
feeding homogentisate and phytol, respectively. In 
meadow rescue leaf, vitamin E increases in the initial 
phase of foliar senescence when phytol is cleaved off 
10 from the chlorophylls and when free phytol is available 
(Peskier et al . , 1989, J. Plant Physiol. 135: 428-432). 
These reports suggest tight coupling of tocopherol 
biosynthesis to the availability of homogentisic acid 
1 and phytol . 

15 Transformation of plants with nucleic acid 

^ constructs that increase the biosynthetic activity of 

3 the tocopherol pathway can lead to increased production 

y of particular tocopherol isomers, for example, a- 

tocopherol, are known in the art and can be found, for 
M 20 example, in PCT International publication WO 00/61771 

which is incorporated herein by reference. Formation 

of a- tocopherol from other tocopherols occurs due to S- 
adenosylmethionine (SAM) -dependent methylases such as 
y-tocopherol methyl transferase. Overexpression of 

25 methyl transferases in combination with 3-hydroxy-3- 
methylglutaryl -Coenzyme A reductase and at least one 
other sterol synthesis pathway enzyme as described 
herein is also contemplated in the present methods. 
Thus, any of the DNAs encoding enzymes of the 

30 tocopherol biosynthetic pathway, discussed above, are 
useful in the present invention. Transformation of 
plants with an early tocopherol biosynthesis gene is 
sufficient to produce plants having an elevated level 
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of tocopherols. By "early tocopherol biosynthesis 
gene" is meant DNA encoding geranylgeranylpyrophosphate 
synthase , geranylgeranylpyrophosphate hydrogenase , 4 - 
hydroxyphenyl pyruvate di oxygenase, and phytyl /prenyl 
transferase. DNA encoding enzymes active in later 
steps of tocopherol biosynthesis ("secondary tocopherol 
biosynthesis genes") can be expressed to enhance carbon 
flux through the tocopherol pathway even further, and 
to produce specific tocopherol isomers. In this way, 
the tocopherol biosynthetic pathway can be modified to 
enhance production of any tocopherol compound of 

interest, such as a- tocopherol . As noted above, a 
variety of sources are available for the early 
tocopherol biosynthesis genes (and other tocopherol 
biosynthesis genes) , and a gene from any of these 
sources can be utilized. If co- suppression occurs when 
a plant gene native to the target host plant is used to 
increase expression of a particular enzyme, a coding 
sequence from another source can be used as an 
alternative . 

Preferred genes for introduction into plants to 
alter tocopherol quantity/quality include 3-deoxy-D- 
arabino-heptulosonate-7-P synthase (DAHP synthase) , 
shikimate kinase, either or both of the prephenate 
dehydrogenases , 1 -deoxy-d-xylulose 5 -phosphate 
synthetase (DXS) , 1 -deoxy-d-xylulose 5-phosphate 
reductoisomerase (DXR) , 4-diphosphocytidyl-2C-methyl-d- 
erythritol synthase (YgbP) , 4-diphosphocytidyl-2C- 
methyl-d-erythritol kinase (YchB) , 2C-methyl-d- 
erythritol 2 , 4 -cyclodiphosphate synthase (YgbB) , the 
gene product of GcpE, LytB (Altincicek et al . , 2001, J". 
Bacteriol., 183:2411-2416; Altincicek et al . , 2001, J". 
Inmiunol., 166:3655-3658; Campos et al . , 2001, FEBS 
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Lett., 488:170-173) , geranylgeranylpyrophosphate 
synthase (GGPPS) , geranylgeranylpyrophosphate 
hydrogenase (GGH) , phytyl/prenyl transferase (PPT) , 4- 
hydroxy-phenylpyruvate dioxygenase (HPPD) , 2 -methyl -6- 
phytylplastoquinol tocopherol methyltransf erase I 

(MTI) , tocopherol cyclase, y- tocopherol 
methyltransf erase (GMT) a plant sir 1736 gene (see 
Cyanobase http://www.kazusa.or.jp/cyanbase), a plant sir 1737 gene 

(see Cyanobase http://www.kazusa.orjp/cyanbase), an ATPT2 gene 

(Smith et al . , Plant J"., 11:83-92, 1977), and an AANTl 
gene (Saint Guily et al . , Plant Physiol., 100:1069- 
1071, 1992) . 

4 -hydroxy-phenylpyruvate dioxygenase and 
geranylgeranylpyrophosphate hydrogenase will increase 
the homogentisate and phytol pools, respectively. 
Enzymes that control fluxes through pathways are well 
known to be regulated in higher organisms such as 
plants . Therefore, 4 -hydroxyphenylpyruvate dioxygenase 
and geranylgeranylpyrophosphate hydrogenase genes of 
microbial origin which are not subject to regulation in 
plants, or those from higher organisms (plants, algae, 
fungi, etc.) that are deregulated, are especially 
attractive in this regard. A non-limiting example is 
the microbial enzyme 4 -amino-4 -deoxyprephenate 
dehydrogenase (TyrA from Er^vinia herhicola) which can 
replace prephenate aminotransferase, arogenate 
dehydrogenase and aminotransferase. Overexpression of 
enzymes such as 3 -deoxy-arabino-heptulosonate 7-P 

(DAHP) synthase, prephenate dehydrogenase, and 
shikimate kinase would lead to increases in the levels 
of homogentisate. DNA encoding any of the tocopherol 
biosynthetic enzymes discussed herein can be introduced 
alone or in various combinations to enhance tocopherol 
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quantity and/or alter tocopherol quality. .When 
introduction of multiple enzymes is desirable, 
preferred combinations include, but aire not limited to, 
4 -hydroxyphenylpyruvate dioxygenase (HPPD) plus 
geranylgeranylpyro -phosphate hydrogenase (GGH) , 
geranylgeranylpyrophosphate synthase (GGPP synthase ) 
plus geranylgeranylpyrophosphate hydrogenase (GGH) , 4- 
amino-4 -deoxyprephenate dehydrogenase (TyrA) plus 
phytylprenyltransf erase (PPT) , geranylgeranylpyro- 
phosphate hydrogenase (GGH) plus 
phytylprenyltransf erase (PPT) , 

geranylgeranylpyrophosphate synthase (GGPP synthase) 
plus phytylprenyltransf erase (PPT) , 2 -methyl- 6- 
phytylplastoquinol tocopherol methyl transferase I (MTI) 
plus phytylprenyltransf erase (PPT) , or 2 -methyl -6- 
phytylplastoquinol tocopherol methyltransf erase I 
(MTI) , phytylprenyltransf erase (PPT) , 4- 
hydroxyphenyl pyruvate dioxygenase (HPPD) and 
geranylgeranylpyrophosphate synthase (GGPP synthase) . 

Plants characterized by increase levels of sterol 
and tocopherol production can be produced by 
transforming plant cells or tissues genetically altered 
for increased sterol production by the methods 
described herein with additional nucleic acid 
constructs encoding tocopherol biosynthetic enzymes. 
Introduction of constructs encoding tocopherol pathway 
enzymes can be accomplished using standard methods in 
molecular biology such as those described herein or 
those described in PCT International Publication WO 
00/61771. Introduction of constructs encoding 3- 
hydroxy- 3 -me thylglutaryl -Coenzyme A reductase, at least 
one other sterol synthesis pathway enzyme, and at least 
one tocopherol synthesis pathway enzyme can be 
accomplished in a single transformation or in a series 
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of transformations. For example, and without 
limitation, plant cells transformed with constructs 
encoding 3 -hydroxy- 3 -methylglutaryl -Coenzyme A 
reductase, at least one other sterol synthesis pathway 
enzyme as described herein could be selected and then 
further transformed with additional constructs encoding 
one or more tocopherol synthesis pathway enzymes and in 
particular S-adenosylmethionine- dependent y- tocopherol 
methyltransf erase enzyme. Successfully transformed 
cells can then be selected and used to regenerate 
plants having increased levels of phytosterols and/or 
phytostanols as well as increased levels of tocopherol. 
Plants produced can then be "selfed", a technique well 
known in the art, to produce uniform populations of 
plants . 

Alternatively, plants characterized by increased 
levels of tocopherols and phytosterols and/or 
phytostanols can be produced by traditional plant 
breeding methods. For example, plants transformed with 
nucleic acid constructs encoding 3 -hydroxy- 3- 
methylglutaryl -Coenzyme A reductase and at least one 
other sterol synthesis pathway enzyme can be sexually 
crossed with high tocopherol plants. Any plant 
transformed to produce increased levels of tocopherols 

and in particular a- tocopherols can be used. Non- 
limiting examples include plants produced by the 
methods described above and in PCT International 
publication WO 00/61771. 

If desired, the plants produced can be selfed to 
produce homozygous, uniform populations of plants. 

Seed obtained from the transgenic, progeny, 
hybrid, etc., plants disclosed herein can be used in 
methods for obtaining oil containing phytosterols. 
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phytosterol esters, phytostanols , phytostanol esters, 
or mixtures thereof along with tocopherols employing 
extraction and processing procedures known in the art. 
Note, in this regard, Kochhar (1983) Prog. Lipid Res, 
5 22: 161-188. Alternatively, seeds with increased levels 
of tocopherols and phytosterols , phytosterol esters, 
phytostanols, phytostanol esters, or mixtures thereof; 
or fruits and vegetables with increased levels of 
tocopherols and phytosterols, phytosterol esters, 
10 phytostanols, phytostanol esters, or mixtures thereof, 
can be used directly. 

Tocopherols and phytosterols and/or phytostanols 
can then be obtained from deodorized distillates of oil 
seed extracts and in particular soybean oil extracts. 



Si 

ni 15 Such deodorized distillates are expected to contain 



increased levels of both tocopherols and phytosterols 
and/or phytostanol extracts. Oil extracts from plants 
and seed of the present invention are particularly 
valuable in that they allow the production of high 

20 sterol/tocopherol oils in a single process thus 

resulting in reduced purification costs, processing 
time and waste stream. Methods for the isolation of 
tocopherols and sterols from plant oils are well known 
in the art and can be found, for example, in U.S. 

25 Patent Nos . 4,454,329; 5,097,012; 5,594,437; and 
5, 981, 781 . 



EXAMPLES 

The following examples are intended to provide 
30 illustrations of the application of the present 

invention. The following examples are not intended to 
completely define or otherwise limit the scope of the 
invention. 
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Example 1 . Enhancement of Phytosterol content in 
seeds of transgenic plants by seed- specif ic 
overexpression of full-length HMG-CoA reductase (HMGR) 
In order to examine the ability of HMGR 
5 overexpression for increasing sterol compound levels in 
seeds, the following experiment was performed in 
Glycine max. A full-length HMGR gene from rubber 
genomic DNA was expressed in developing Glycine max 
seeds using the 7S promoter. This was achieved by 

10 excising the rubber HMGR gene from the plasmid pHEV15 
(Schaller et al . , (1995) Plant Physiol., 109: 761-770) 
using £;coRI . The 3.8 Kb fragment was inserted into the 
EcoRl site of pMON29920 (Fig. 3) such that the HMGR 
gene is flanked by the 7S promoter on the 5' end and 

15 the E9 3' terminator to create pMON43800 (Fig. 4). This 
was next digested with Sail and Notl to release a 7.7 
Kb fragment that was then blunt-ended at the Sal 1 end 
before ligating to pMON23616 (Fig. 5) that was first 
cut with Smal and Notl. This created the pMON43818 

20 binary vector that contains the rubber HMGR gene driven 
by 7S promoter and the NPTII gene selection marker 
driven by the NOS promoter and 3' NOS terminator. 
PMON43818 (Fig. 6) was used to transform Agrohacterium 
tumefaciens and transform Glycine max cotyledon 

25 explants as described in Example 2 . 

Seeds from 15 transgenic plants and one 
nontransgenic control plant were harvested at maturity. 
Sterol extraction and analysis on ten individual seeds 
per plant were performed as described in Example 2 . 

30 Results are presented in Table 2 . 
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Table 2 : Sterol profile of transgenic soybean 
plants expressing rubber HMGR gene driven by 7s 

5 promoter. Event 1: control, events 2-16:15 transgenic 
plants generated by 15 independent events using 
AgrroJbacteriuiT? mediated transformation. 

Total sterols increased by 3.2- and 3.9- fold in 
the best performing plants (transgenic events 3 and 4) . 

10 These two events also showed the highest increases of 
individual sterols. Campesterol increased by 2.7-fold, 
sitosterol by 3.4-fold, sitostanol by 3.2-fold and 
other sterols by 6.5-fold in event 3 while stigmasterol 
increased by 2. 3- fold in event 4. The other sterols, 

15 which account for the highest increase in total sterols 
were pathway intermediates that included squalene, 
cycloartenol , 24 -methylene cycloartenol , obtusif oliol , 
isof ucosterol , and stigmasta-7-enol . These pathway 
intermediates normally form minor constituents in the 
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Sterol composition of seeds. However, in the transgenic 
seeds, probably due to increased carbon flux through 
the pathway, they accumulate in significant amounts. 
This suggests additional control points for sterol 
biosynthesis in plants such as squalene epoxidase, C-24 
sterol methyltransf erase , and C-14 obtusifoliol 
demethylase . 

Example 2 : Enhance phytosterol biosynthesis in 
seeds of transgenic soybean plants by seed- specif ic 
expression of catalytic domain of HMG-CoA Reductase 
(HMGR) alone and in combination with sterol methyl 
transferase II (SMTII) 

In another embodiment of the present 
invention, the levels of sterol compounds, including 
sitosterol, sitostanol, campesterol, stigmasterol and 
at least one ester for each of the sterol compounds and 
mixture there of, can be elevated in plant seeds by 
overexpression of catalytic domain of plant -HMG-CoA 
reductases. One can transform a plant of interest 
using an expression cassette or vector comprising DNA 
encoding a polypeptide exhibiting 3 -hydroxy- 3- 
methylglutaryl -Coenzyme A reductase (HMG-CoA reductase 
or HMGR) activity. HMGR cDNAs from rubber have been 
successfully used to increase plant sterol levels in 
plant tissues (Schallet et al . (1995) Plant Physiol. 
109: 761-770) . Full-length and truncated forms of HMGR 
CDNAs encoding full-length and catalytic domain of 
HMGR, respectively, from Arahidopsis have also been 
used to overproduce sterols in transgenic Arabidopsis 
plants (Gonzalez et al . (1997) Third Terpnet Meeting of 
the European Network on Plant Isoprenoids Abstracts , 
Abstract No. 33, page 33) . In the above examples 
however, the genes have not been specifically targeted 
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to increase sterol levels in seeds. Another approach 
to enhance the nutritionally beneficial 24 -ethyl 
sterols (sitosterol, sitostanol) and reduce the 
accumulation of 24 -methyl sterols (campesterol ) in 
seeds one can co-express two genes encoding the enzymes 
HMGR and sterol methyl transferse II (SMTII) , each 
under the control of seed- specif ic promoter. Here we 
present evidence for such approaches : sterol 
composition of transgenic soybean seeds haboring 
truncated form (catalytic domain of HMGR without 
linker) of Arahidopsis HMGRl is presented in Figure 11 
and Table 3. Sterol composition of transgenic soybean 
seeds haboring Arahidopsis HMGRl (catalytic domain of 
HMGR without linker) and Arabidopsis SMTII is presented 
in Figure 12 and Table 4 . 

In order to examine whether overexpression of 
the catalytic domain of HMGR increases sterol levels in 
the seeds of transgenic soybean, the following 
experiment was performed in Glycine max. A truncated 
form of HMGRl cDNA encoding only the catalytic domain 
of HMGR from Airahidopsis was expressed in developing 
seeds of Glycine max using the seed-specific 78 
promoter. This was achieved by excising the cDNA 
fragment (HMGRlcd) encoding the HMGRl catalytic domain 
from the plasmid pHMGRlcd (Dale et al . , (1995) Eur. J. 
Biochem, 233: 506-513) using Ndel and Smal enzymes 
resulting in the isoloation of a 1 . 9 Kb fragment. The 
Ndel overhang was filled-in and the 1.9 Kb fragment 
was blunt-end ligated to vector pMON43818 (Figure 6) , 
previously Xhol (Xhol overhang was filled-in) and Smal 
digested such that the HMGRlcd was flanked by the IS 
promoter on the 5' end and the E9 2' terminator to 
create a recombinant vector pMON43 052 (Figure 7) . This 
was next digested with Xbal and blunt -ended and then 
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digested with NotI to release a 3.4 Kb fragment and 
ligated to pMON51850 (Figure 8) that was digeted with 
Smal and Notl. The ligation created a recombinant 
binary vector pMON43057 (Figure 9) that contained the 
cDNA fragment encoding the catalytic domain of 
Arabidopsis HMGRl , driven by 7S promoter and E9 3' 
terminator and the NPTII selectable marker gene driven 
by the NOS promoter and 3' NOS terminator. The 
pMON43057 was used for AgrobactGrium tumefaciens 
mediated transformation of Glycine wax cotyledon 
explants. The pMON43058 (Figure 10) construct carrying 
both the catalytic domain of Arabidopsis HMGRl and 
Arabidopsis SMTII , both driven by the 7S promoter, was 
also used for AgrroJbacteriujn teiTief aciens-mediated 
transformation of Glycine max in a similar manner 
descibed below. 

Explants for transformation were prepared as 
follows: sterilized seeds were germinated on 
germination medium under light at 2 8°C for 5-6 days. 
Germinated seeds were placed in the dark at 4°C for 24 
hours prior to excision. Seed coats were removed and 
hypocotyls of each seedling trimmed to a length of 0.5 
cm to 1.0 cm in length. The cotyledons were then split 
open such that the hypocotyl was split down in the 
middle. The primary leaves and apical region of each 
cotyledon was removed to expose the wounding region. 
Wounding was performed with 3-7 shallow, scalpel scores 
in line with the embryo axis, ensuring that the apical 
bud was damaged. Wounded explants were incubated in 
the culture of Agrrobacterium tujnef aci ens containing 
pMON43057. Incubation was for 1 hour at room 
temperature. Innoculated explants were then 
transferred to a co-culture medium and placed under 
light at 2 3°C for 3-4 days. At this time explants were 
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transferred to shooting medium without kanamycin 
selection and placed in a 2 5°C light growth room for 4 
days . 

After 4 days on delay, explants were transferred 
to a 186 ppm kanamycin selection medium and placed in a 
2 5°C light growth room for 2 weeks. At the end of two 
weeks explants were transferred to 186 ppm Woody Plant 
medium and placed again in a 25°C light growth room for 
another 2 weeks. Cultures were transferred every 2 
weeks to fresh medium for approximately 18-21 weeks. 
At the 6 week transfer, the cotyledons and any dead 
material were removed from the explants, and the 
petiole was cut. At each subsequent 2 week transfer, 
the petiole was cut to expose fresh cells to the 
medium. 

Transgenic shoots that were approximately in 
length, with 2 nodes, 1 open trifoliate and an active 
growing point were selected, cut and transferred to 
rooting medium. Once a good root system was developed, 
the plants were sent to the greenhouse to grow up in 
soils in pots. 

Seeds from the 15 transgenic plants and one 
nontransgenic control plant were harvested at maturity. 
Ten individual seeds from each plant were weighed and 
ground into fine powder using an electric grinder. A 

known amount of cholestane (usually 100|j,g in 100)al 
ethanol) was added to each approximately 5 0mg powder 
sample. Sterol compounds were hydrolyzed directly from 
the ground tissue by saponification with 2ml of 10% KOH 
in methanol by refluxing the material at 60°C for 30 
minutes. The refluxed samples were cooled to room 
temperature and filtered through glass wool . An equal 
volume of water was added to each filtrate, and the 
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nonsaponif iables were extracted by partitioning three 
times with equal volumes of hexane . The hexane phases 
were pooled and evaporated. The residues were 
resuspended in 1 ml of acetone, and quantatively 
transferred to glass GC vials that were immediately 
capped. Sterols were analyzed by Gas Chromatography- 
Flame Ionizing Detector using the following conditions: 
Inlet temperature of 220°C, detector temperature of 
32 0°C, and column oven temperature programmed from 22 0°C 
to 320°C with initial temperature for 1 minute and final 
temperature for 16 minutes and ramp rate of 8°/min. The 
column used was a glass capillary DB-5 column of 50 m 

length, 320 ^m diameter, and a film thickness of 0.25 

|im. The carrier gas was helium at a flow rate of 
l.Oml/min. Results are presented in Table 3 and Table 
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To fully characterize the sterol compounds present 
in the transgenic seeds, a representative sample was 
also analyzed by Gas Chromatograpy-Mass Spectrometry 
(GC-MS) for confirmation of the sterol compounds 
present. The GC-MS conditions were as follows: inlet 
temp. 2 50°C, detector 3 2 0°C, oven programmed from 18 0°C 
to 325°C with initial equilibration time of 1.0 min, 
ramping to 310°C at 4°/min at then 2d°/min to 325°C. The 
column was a DB-5 capillary glass column similar to the 
one used for GC-FID. 

Majority of the transgenic lines harboring 
pMON43057 showed 3 to 5-fold increase in total sterols. 
The best performing transgenic lines, GM_A13342 and 
GM_A13 634, showed 6.5- and 6.1- fold increase in total 
sterols, respectively. These lines showed 2- to 2.6- 
fold increase in sitosterol, 1.5 to 2. 2 -fold increase 
in sitostanol and no significant change in the 
campesterol levels. Hence the major proportion of the 
total sterol increase was accounted by the accumulation 
of pathway intermediates which include squalene, 
cycloartenol , 24 -methylene cycloartenol , obtusif oliol , 
isof ucosterol , and stigmasta-7-enol . The best 
performing transgenic lines, GM_A13342 and GM_A13 634, 
showed 32.6- and 32. 2 -fold increase in pathway 
intermediates accumulation, respectively, as compared 
to the control. In all the transgenic lines harboring 
the pMON43057, 50-70% of the total increase was 
accounted by the increase in the pathway intermediates 
accumulation as compared to the control . The pathway 
intermediates include squalene, cycloartenol, 24- 
methylene cycloartenol , obtusif oliol , isof ucosterol , 
and stigmasta-7-enol . 

Six transgenic lines haboring pMON43058 produced 
5-8- to 6-fold increase in total sterols and the rest 
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of the 10 transgenic lines with the pMON43058 showed 3- 
to 5-fold increase in total sterols. The best 
performing transgenic lines showed about 2- to 3 -fold 
increase in sitosterol and 4.5- to 6-fold increase in 
sitostanol levels. However, the campesterol 
accumulation was reduced by 50% in these lines. This 
was due to overexpression of the Arahidopsis SMTII 
enzyme which enhances the carbon flux towards the 
synthesis of 24 -ethyl sterols thereby reducing the 
carbon flux through the pathway leading to the 
synthesis of 24 -methyl sterols. As seen in pMON43 057 
transgenic lines, all of the transgenic lines harboring 
the pMON43058 also accumulated 50-60% of the total 
sterols in the form of pathway intermediates which are 
squalene, cycloartenol , 24 -methylene cycloartenol , 
obtusif oliol , isof ucosterol , and stigmasta-7-enol . 
These pathway intermediates normally form minor 
constituents in the sterol composition of seeds. 
However, in the transgenic seeds, probably due to 
increased carbon flux through the pathway, they 
accumulate in significant amounts. The pathway 
intermediates accumulation is highly significant when 
the truncated from of HMGR is overexpressed as compared 
to the full length form of HMGR suggesting that the 
overexpression of the truncated form of HMGR creates 
even greater increase in carbon flux through the 
pathway. This provides further evidence for additional 
control points for sterol biosynthesis in plants such 
as squalene epoxidase, sterol methyltransf erase I, 
sterol C4 -demethylase , obtusif oliol C14a-demethylase , 
sterol C5-desaturase , and sterol methyl transferase II. 
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Example 3 . Enhancement of phytosterol biosynthesis 
in seeds of Arabidopsis transgenic plants by 
constitutive expression of different forms of 
Arabidopsis and rubber HMGR enzymes. 

Arabidopsis transgenic plants were generated using 
Agrobacterium mediated transformation of constructs 
(pMON53733, pMON53734, pMON53735, pMON53736, pMON53737, 
pMON53738, pMON53739, pMON53740) carrying cDNA encoding 
different forms of Arabidopsis and rubber HMGR enzymes 
driven by CaMV enhanced 3 5S promoter (Figures 13-20) . 
The transformed Arabidopsis seeds carrying each of the 
above constructs were selected on kanamycin (50|ag/ml) 
medium to select for transf ormants expressing the 
selectable marker, the NPTII gene driven by CaMV 3 5S 
promoter. Kanamycin resistant .Arabidopsis transgenic 
plants were grown in green house for maturity and seeds 
were collected from each of the transgenic lines for 
sterol analysis. About 50 mg of seeds from each 
transgenic line were weighed, homogenized and used for 
saponification to extract sterols as described in 
Example 2 . 

Figures 21-26 describe the sterol analysis data 
obtained from the transgenic lines carrying each of the 
above constructs. Figure 27 shows the effect on 
different sterol end products and pathway intermediate 
accumulation when different forms of rubber HMGR cDNAs 
were expressed const itutively in transgenic Arabidopsis 
plants. When truncated rubber HMGR (with or without 
linker region) was overexpressed the total sterol 
accumulation in seeds increased by 2.9 to 3.7-fold as 
compared with the wild type control plants. The sterol 
end products such as campesterol and sitosterol showed 
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1.5 to 2 -fold increase in the lines expressing 
truncated form of rubber HMGR (with and without 
linker) . However the sitostanol end product 
accumulation in the transgenic lines harboring the 
truncated form of rubber HMGR (with and without linker) 
was enhanced by 2.8 to 7-fold. There is a significant 
accumulation of pathway intermediates such as 
cycloartenol and 24 -methylene cycloartenol in the seeds 
of the transgenic lines transformed with the truncated 
form of rubber HMGR (with and without linker region) . 
The wild type control plants used in the experiment do 
not accumulate both of the pathway intermediates. 

I 

Example 4 , Comparison of Steroid Compounds from 
HMGR Constructs in a Yeast HMGRl Knockout Mutant 

The effects on the sterol levels of the expression 
of various HMGR constructs expressed in a yeast HMGRl 
knockout mutant were compared. Constructs containing a 
nucleic acid encoding the full length HMGR polypeptides 
from Arabidopsis and rubber were compared to those 
encoding a truncated Arahidopsis or rubber HMGR 
polypeptide that were lacking both the membrane binding 
and linker region of HMGR. The control yeast cells were 
transformed with a similar construct lacking a 
polypeptide encoding any form of HMGR. 

Yeast cells transformed with Arahidopsis HMGR and 
rubber HMGR constructs accumulated approximately the 
same amounts of zymosterol and ergosterol , but more 
squalene than the control yeast. 

Transformed yeast cells having rubber HMGR 
constructs accumulated about the same amount of 
ergosterol, but about twice as much squalene and 
zymosterol than the control yeast . 
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Transformed yeast cells having Arabidopsis tHMGR 
constructs accumulated three times as much squalene, 
twice as much zymosterol, and about 30 percent more 
ergosterol than the control yeast . 

Transformed yeast cells, having rubber tHMGR 
constructs accumulated three times as much squalene, 
four times as much zymosterol, and about 50 percent 
more ergosterol than the control yeast . 

The data are shown in a Figure 28, "Plant HMGRl 
Contructs in Yeast HMGRl Knockout Mutant". 

Example 5. Gene sequences for all genes listed in 
the application 

The sequences obtained from the NCBI public 
database are SEQ ID NO.: 1,2,3,20,21,22,23. These 
sequences are included in the appendix and denoted as 
follows : 
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SEQ ID 1 = Arabidopsis squalene epoxidase protein 
sequence (Accession NO: AC004 786) See Appendix A 

SEQ ID 2 = Arabidopsis squalene epoxidase 
(Accession NO: N64 916) See Appendix B 

SEQ ID 3 = Arabidopsis squalene epoxidase 
(Accession NO: T44 667) See Appendix C 

SEQ ID 4 = Arabidopsis squalene epoxidase (clone 
ID: ATA506263) nucleotide sequence 



143 MTC 6783.1 

PATENT 

GAATTCCCGGGTCGACCCACGCGTCCGCTTATAGATAAGGATATGGCCTT 
TACGAACGTTTGCCTATGGACGCTACTCGCCTTCATGCTGACTTGGACAGTGTTC 
TACGTCACAAACAGGGGGAAGAAGGCGACGCAGTTGGCGGATGCGGTGGTTGAAG 
AGCGAGAAGACGGTGCTACTGACGTTATCATCGTTGGGGCTGGAGTAGGCGGCTC 
5 GGCTCTCGCATATGCTCTTGCTAAGGACGGGCGTCGAGTCCATGTAATAGAGAGG 
GACCTGAGAGAACCAGAGAGAATCATGGGTGAGTTTATGCAACCAGGAGGACGAC 
TCATGCTCTCTAAGCTTGGTCTTGT^GATTGTTTGGAGGGAATAGATGCCCAAAA 
AGCCACGGGCATGACAGTTTATAAGGACGGAAAAGAAGCAGTCGCATCTTTTCCC 
GTGGACAACAACAATTTTCCTTTTGATCCTTCGGCTCGATCTTTTCACAATGGCC 

10 GATTCGTCCAACGATTGCGGCAAAAGGCTTCTTCTCTTCCCAATGTGCGCCTGGA 
AGAAGGAACGGTGAAGTCTTTGATAGAAGAAAAAGGAGTGATCAAAGGAGTGACA 
TACAAAAATAGCGCAGGCGAAGAAACAACAGCCTTGGCACCTCTCACTGTAGTAT 
GCGACGGTTGCTACTCAAACCTTCGCCGGTCTCTTAATGACAACAATGCGGAGGT 
TCTGTCATACCAAGTTGGTTTTATCTCAAAGAACTGTCAGCTTGAAGAACCCGAA 

15 AAGTTAAAGTTGATAATGTCTAAACCCTCCTTCACCATGTTGTATCAAATCAGCA 
GCACCGACGTTCGTTGTGTTTTTGAAGTTCTCCCCAACAACATTCCTTCTATTTC 
AAATGGTGAAATGGCTACTTTCGTGAAGAACACTATTGCTCCTCAGGTACCTTTA 
AAACTCCGCAAAATATTTTTGAAAGGGATTGATGAAGGAGAACATATAAAAGCCA 
TGCCAACAAAGAAGATGACAGCTACTTTGAGCGAGAAGAAAGGAGTGATTTTATT 

20 GGGAGATGCATTCAACATGCGTCATCCAGCAATCGCATCTGGAATGATGGTTTTA 
TTATCTGACATTCTCATTTTACGCCGTCTTCTCCAGCCATTAAGCAACCTTGGCA 
ATGCGCAAAAAATCTCACAAGTTATCAAGTCCTTTTATGATATCCGCAAGCCAAT 
GTCAGCGACAGTTAACACGTTAGGAAATGCATTCTCTCAAGTGCTAGTTGCATCG 
ACGGACGAAGCAAAAGAGGCAATGAGACAAGGTTGCTATGATTACCTCTCTAGTG 

25 GTGGGTTTCGCACGTCAGGGATGATGGCTTTGCTAGGCGGCATGAACCCTCGTCC 
GATCTCTCTCATCTATCATCTATGTGCTATCACTCTATCCTCCATTGGCCATCTA 
CTCTCTCCATTTCCCTCTCCCCTTGGCATTTGGCATAGCCTTCGACTTTTTGGTT 
TGGCTATGAAAATGTTGGTTCCCCATCTCAAGGCTGAAGGAGTTAGCCAAATGTT 
GTTTCCAGTCAACGCCGCCGCGTATAGCAAAAGCTATATGGCTGCAACGGCTCTT 

30 TAAAACACTGGTGCTTTAAACTGCAAAATATAACACATATATAAATCCCGAATCT 
TTGTGATTCTGCATATATTGTGTTCTACAATTATTCTCATATAAATGAAAATTGT 
TCTACGTAAAAGTAAAAAGAAGGAATTGTAATACTAATAAAACGAGTTTTTAATT 
CTGTTGAATGCTTGTGTATATTGGTGAAAAAAAAAAAAAT^AAAAAAAAAAAAAAA 
AAAAAAAAAGGGCGGCCGC 

35 

SEQ ID 5 = Arabidopsis squalene epoxidase (clone 
ID: ATA506263) amino acid translation 

EFPGRPTRPLIDKDMAFTNVCLWTLLAFMLTWTVFYVTNRGKKATQLADA 
40 WEEREDGATDVIIVGAGVGGSALAYALAKDGRRVHVIERDLREPERIMGEFMQP 
GGRLMLSKLGLEDCLEGIDAQKATGMTVYKDGKEAVASFPVDNNNFPFDPSARSF 
HNGRFVQRLRQKASSLPNVRLEEGTVKSLIEEKGVIKGVTYKNSAGEETTALAPL 
TWCDGCYSNLRRSLNDNNAEVLSYQVGFISKNCQLEEPEKLKXIMSKPSFTMLY 
QISSTDVRCVFEVLPNNIPSISNGEMATFVKNTIAPQVPLKLRKIFLKGIDEGEH 
45 IKAMPTKJCMTATLSEKKGVILLGDAFNMRHPAIASGMIWLLSDILILRRLL^ 

NLGNAQKI SQVI KSFYDIRKPMSATVNTLGNAFSQVLVASTDEAKEAMRQGCYDY 
LSSGGFRTSGIVIMALLGGMNPRPISLIYHLCAITLSSIGHLLSPFPSPLGIWHSLR 
LFGLAMKMLVPHLKAEGVSQMLFPVNAAAYSKSYMAATAL* 
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GAATTCCCGGGTCGACCCACGCGTCCGCGGACGCGTGGGATTGAGAACAA 
ATAGATTTGGTTATATATGGCTTTTACGCACGTTTGTTTATGGACGTTAGTCGCC 
5 TTCGTGCTGACGTGGACGGTGTTCTACCTTACCAACATGAAGAAGAAGGCGACGG 
ATTTGGCTGATACGGTGGCTGAGGATCAAAAAGACGGTGCTGCTGACGTCATTAT 
CGTCGGGGCTGGTGTAGGTGGTTCGGCTCTCGCATATGCTCTTGCTAAGGATGGG 
CGTCGAGTACATGTGATCGAGAGGGACATGAGAGAACCAGAAAGAATGATGGGTG 
AGTTTATGCAACCTGGCGGACGACTCATGCTTTCTAAACTTGGCCTTCAAGATTG 
10 CTTGGAAGACATAGATGCACAGAAAGCCACGGGTTTGGCAGTTTATAAAGATGGA 
AAAGAAGCAGACGCACCTTTTCCAGTGGATAACAACAATTTTTCTTATGAACCTT 
CTGCTCGATCTTTTCACAATGGCCGATTCGTCCAACAACTGCGTCGAAAGGCTTT 
TTCTCTTTCCAATGTGCGCCTGGAAGAAGGAACGGTGAAGTCTTTACTAGAAGAA 
AAAGGAGTGGTCAAAGGAGTGACATACAAGAATAAAGAAGGCGAAGAAACAACAG 
15 CCTTGGCACCTCTCACTGTGGTATGCGACGGTTGCTACTCAAACCTTCGTCGGTC 
TCTTAATGATGACAACAATGCTGAGATTATGTCGTACATAGTTGGTTACATCTCA 
AAGAATTGTCGGCTTGAAGAACCCGAAAAGCTACACTTGATATTGTCTAAACCAT 
CTTTCACCATGGTATACCAAATAAGCAGCACTGACGTTCGTTGTGGTTTTGAGGT 
TCTCCCCGAAAATTTTCCTTCTATTGCAAATGGTGAAATGTCTACTTTCATGAAG 
20 AATACTATAGTTCCTCAGGTACCTCCAAAACTCCGCAAAATATTTTTGAAAGGTA 
TAGATGAGGGAGCACACATAAAAGTGGTGCCGGCAAAGCGCATGACATCTACTTT 
AAGCAAGAAGAAAGGTGTGATTGTATTGGGAGATGCATTCAATATGCGTCATCCA 
GTTGTTGCATCTGGAATGATGGTTTTACTGTCGGACATTCTCATTCTACGCCGTC 
TTCTTCAGCCATTAAGCAACCTCGGCGATGCAAACAAAGTCTCAGAAGTTATCAA 
25 TTCCTTTTATGATATCCGCAAGCCAATGTCGGCGACGGTTAACACATTGGGAAAT 
GCATTTTCTCAAGTACTAATTGGATCAACGGATGAAGCAAAAGAGGCAATGAGAC 
AGGGTGTCTATGATTACCTTTGTAGTGGCGGGTTTCGTACGTCAGGGATGATGGC 
n TCTGCTCGGCGGCATGAATCCTCGTCCTCTCTCTCTCGTCTATCATCTTTGTGCC 
B ATCACTCTATCCTCCATTGGCCAACTGCTCTCTCCATTTCCCTCTCCCCTTCGCA 
M 30 TTTGGCATAGCCTCAAGCTTTTTGGTTTGGCCATGAAAATGTTGGTTCCCAATCT 

CAAAGCTGAAGGAGTTAGCCAAATGTTGTTTCCAGCAAATGCAGCCGCGTATCAC 
AAAAGCTATATGGCTGCAACCACTCTCTAAACTTTGATGCTCTCAATCGCAATAT 
ATATGGAGCACGAATCTATGTGATTGTGCATTTGGTAAACGTGTATTGCAGTGCT 
TATAATTATTAGTATGTAACGGGGAAAAGTTCTAAACACAAAAAAATAAACTTTT 
35 GAATGTTATATGTGTGAATTATTTTTGTTGTTACAAGTAATGCTCTTTTTTTTTA 
GCTTCACACATGTATTATTGGAGCTAATTTTTTGTTTCTCTGTTCTTTTATTTTT 
GTTTTCTTACTGTATTTACTTTGAAAAGTTTCGTTTTATACATATTGGACATTTT 
TTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGG 
40 GCGGCCGC 

SEQ ID 7 = Arabidopsis squalene epoxidase (clone 
ID: ATA304243) amino acid translation 

45 MAFTHVCLWTLVAFVLTWTVFYLTNMKJ<XATDLAD WAE I VGAGV 

GGSALAYAJ^KDGRRVHVIERDMREPERI^GEFMQPGGRLMLSKLGLQDCLEDID 
AQKATGIAVYKDGKEADAPFPVDNlSnsrFSYEPSARSFHNGRFVQQLRRKAFSLSW 
RLEEGTVKSLLEEKGWKGVTYKNKEGEETTALAPLTWCDGCYSNLRRSLNDDN 
NAEIMSYIVGYISKNCRLEEPEKLHLILSKPSFTMVYQISSTDVRCGFEVLPENF 

50 PSIANGEMSTFMKNTIVPQVPPKLRKIFLKGIDEGAHIKVVPAKRMTSTLSKKKG 
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VIVLGDAFNMRHPWASGMMVLLSDILILRRLLQPLSNLGDANKVSEVINSFYDI 
RKPMSATWTLGNAFSQVLIGSTDEAKEAMRQGWDYLCSGGFRTSGMM^ 
NPRPLSLVYHLCAITLSSIGQLLSPFPSPLRIWHSLKLFGIiAMKMLVPNLKAEGV 
SQ MLFPANAAAYHKSYMAATTL* 

5 

SEQ ID 8 = Arabidopsis squalene epoxidase (clone 
ID: ATA102071) nucleotide sequence 

AAATCATATTGAGAACAAATAGATTTGGTTATATATGGCTTTTACGCACG 
10 TTTGTTTATGGACGTTAGTCGCCTTCGTGCTGACGTGGACGGTGTTCTACCTTAC 
CAACATGAAGAAGAAGGCGACGGATTTGGCTGATACGGTGGCTGAGGATCAAAAA 
GACGGTGCTGCTGACGTCATTATCGTCGGGGCTGGTGTAGGTGGTTCGGCTCTCG 
CATATGCTCTGCTAAGTGTGCGCCTGGAAGAAGGAACGGTGAAGTCTTTACTAGA 
AGAAAAAGGAGTGGTCAAAGGAGTGACATACAAGAATAAAGAATGCGAACAAACA 
15 ACAGCCTTGGCACCTCTCACTGTGGTATGCGACGGTTGCTAATCAAACCTTCGTC 
Q GGTCTCTTAATG 

^ SEQ ID 9 = Arabidopsis squalene epoxidase (clone 

S ID: ATA102071) amino acid translation 

m 20 

Kl MAFTHVCLWTLVAFVLTWTVFYLTNMKKKATDIiADTVAEDQKDGAADVI I 

pd VGAGVGGSALAYALLSVRLEEGTVKSLLEEKGWKGVTYKNKECEQTTALAPLTV 

bj VCDGC 
Q 25 

y"' SEQ ID 10 = Arabidopsis squalene epoxidase (clone 

ID: ATA504158) nucleotide sequence 



t 



CACAAAGCAAAAAAATCTCTGTAAAAGCAGAACGATAATGGAGTCACAAT 
30 TATGGAATTGGATCTTACCTCTTTTGATCTCTTCTCTCCTCATCTCCTTCGTCGC 
TTTCTATGGATTCTTCGTCAAACCGAAGCGGAACGGTCTCCGTCACGATCGGAAA 
ACTGTTTCTACCGTCACCTCCGACGTCGGATCTGTTAATATTACCGGAGATACTG 
TCGCTGATGTCATTGTTGTTGGAGCTGGTGTTGCTGGTTCTGCTCTTGCTTATAC 
TCTTGGAAAGGGGAAATTTAAACGCCGAGTTCATGTGATTGAAAGAGATTTATCG 
35 GAGCCTGATCGTATTGTTGGGGAGTTGTTACAGCCTGNGGGTTACCTCAAGTTAC 
TGGAGTGTGGAATTGGAGATTGTGTGGAAGAAATAGATGCTCAGCNTGTGTATGG 
TTATGCACTTTTTAAAAATGGG 



SEQ ID 11 = Arabidopsis squalene epoxidase (clone 
40 ID: ATA504158) amino acid translation 

TKQKNLCKSRTIMESQLWNWILPLLISSLLISFVAFYGFFVKPKRNGLRH 
DRKTVSTVTSDVGSVNITGDTVADVIWGAGVAGSAIiAYTLGKGKFKRRVHVIER 
DLSEPDRIVGELLQPXGYLKLLECGIGDCVEEIDAQXVYGYAIjFKNG 



45 



SEQ ID 12 = Arabidopsis obtusifoliol C14a- 
demethylase nucleotide sequence (Accession NO: 
complement, join AC002329 : 37461...38456 , 
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AC002329 : 39121...39546) (homolog of sorghum obtusifoliol 
C14a-demethylase) nucleotide sequence 

CGTGTTTTACAAATTTCCTTTGTTGGTTTTGCACAGATTTAAAGAACCCT 
5 AACGAGAGAAAAAAATGGACTGGGATTACTATACGCTGTTGAAGACGAGTGTGGC 
TATTATTATAGTGTTTGTTGTGGCCAAACTCATAACCTCCTCCAAATCCAAGAAG 
AAAACAAGTGTCGTCCCACTCCCTCCAGTTCTTCAAGCGTGGCCTCCATTTATCG 
GATCCCTAATCCGCTTCATGAAAGGTCCAATAGTGCTACTTAGAGAGGAATATCC 
TAAGCTTGGAAGTGTTTTCACAGTGAAGCTTCTTCACAAAAACATCACTTTTCTC 
10 ATCGGTCCCGAAGTCTCGTCCCACTTTTTCAACGCTTATGAATCTGAACTCAGCC 
AGAAAGAAATTTACAAATTTAATGTGCCTACTTTTGGCCCCGGAGTTGTGTTTGA 
TGTTGACTATCCCGTTCGGATGGAGCAGTTCCGATTCTTCTCCAGCGCTCTCAAG 
GATTACTTCTCAAAATGGGGAGAAAGTGGGGAAGTGGATCTAAAGGCCGAGTTAG 
AGCGTCTAATCACCTTGACTGCTAGTAGATGTCTATTGGGTCGAGAAGTCCGTGA 
15 CCAACTTTTTGATGATGTTGCTCCATTGTTCCATGACCTTGATAAAGGCATGCAA 
CCCATAAGTGTCATCTTCCCAAAGCTCCCCATTCCAGCTCACAATTGTCGTGACC 
O GTGCTCGCGGAAAGATTGCAAAAATCTTTTCAAACATCATAGCAACAAGAAAACG 

CTCTGGTGACAAATCAGAGAACGACATGCTACAATGTTTCATCGACTCAAAGTAC 
AAAGACGGTAGAGAGACAACTGAATCTGAAGTAACTGGTTTGCTCATTGCTGGTT 
20 TGTTTGCAGGACAACATACAAGCTCTATCACTGCCACATGGACCGGTGCTTATCT 
AATTCAAAACAAACACTGGTGGTCCGCGGCTTTGGACGAGCAGAAGAAACTGATT 
GGAAAACATGGGGACAAGATCGACTACGATGTTTTGTCTGAGATGGATTTTCTGT 
TTCGCAGTGCAAAAGAAGCTTTAAGGCTTCACCCTCCAAAGATCTTACTGCTGAG 
AACAGTACACAGTGATTTCACCGTGACAACTCGAGAAGGAAAGCAATATGAGATA 
25 CCAAAGGGTCATATCGTTGCAACTTCTCCTGCATTCGCCAACCGCTTACCTCATG 
TCTACAAAGATCCGGAAAATTTTGATCCGGATAGATTTTCAAAGGAAAGAGAAGA 
GGATAAAGCAGCTGGTTCGTGTTCATACATCTCTTTGGGAGCTGGTAGGCACGAG 
TGTCCTGGTGGATCATTTGCGTTCTTGCAGATCAAAGCCGTATGGTGTCACTTAT 
TGAGAAACTTTGAGCTTGAGTTAGTGTCACCTTTCCCTGAAATCAACTGGAATGC 
30 TTTGGTCGTTGGTGCTAAAGGAAATGTCATGGTTCGTTACAAGCGTCGTCCCTTT 
TCTTAA 
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SEQ ID 13 = Arabidopsis obtusifoliol C14a- 
35 demethylase nucleotide sequence (Accession NO: 
complement, join AC002329 : 37461...38456 , 

AC002329 : 39121...39546) (homolog of sorghum obtusifoliol 
C14a-demethylase) amino acid translation 

40 MDWDYYTLLKTSVAI 1 1 VFWAKLITSSKSKKKTSWPLPPVLQAWPPFI 

GSLIRFMKGPIVLLREEYPKIiGSVFTVKLLHKNITFLIGPEVSSHFFNAYESELS 
QKEIYKFNVPTFGPGWFDVDYPVRMEQFRFFSSALKDYFSKWGESGEVDLKAEL 
ERLI TLTASRCLLGREVRDQLFDDVAPLFHDLDKGMQP I S VI FPKLP I PAHNCRD 
RARGKIAKIFSNIIATRKRSGDKSENDMLQCFIDSKYKDGRETTESEVTGLLIAG 

45 LFAGQHTSSITATWTGAYLIQNKHWWSAALDEQKKLIGKHGDKIDYDVLSEMDFL 
FRSAKEALRLHPPKILLLRTVHSDFTVTTREGKQYEIPKGHIVATSPAFANRLPH 
VYKDPENFDPDRFSKEREEDKAAGSCSYISLGAGRHECPGGSFAFLQIKAWCHL 
LRNFELEL VS P F PE I NWNAL WGAKGNVMVR YKRRPF S * 



50 



SEQ ID 14 = Arabidopsis obtusifoliol C14a- 
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demethylase (clone ID: ATA101105) nucleotide sequence 



GACACTATAGAAGAGCTATGACGTCGCATGCACGCGTACGTAAGCTCGGA 
ATTCGGCTCGAGCTTGTTCACAAAAAGATTACTTTTCTTATTGGTCCTGAAGTCT 
5 CTGCTCATTTTTTCAAAGCTTCTGAATCTGATCTTAGTCAGCAGGAAGTGTATCA 
GTTCAATGTCCCTACTTTTGGTCCTGGAGTTGTTTTCGATGTTGATTATTCTGTT 
TCGTCAGGAGCAGTTCGGTTCTTCACTGAGGCACTTAGAGTTAACAAGTTGAAGG 
GTTATGTGGATATGATGGTTACTGAAGCTGAGGATTACTTCTCTAAATGGGGAGA 
GAGTGGTGAAGTTGATATTAAGGTTGAGCTAGAGAGGCTCATCATCTTGACTGCA 
10 AGTGATGTTTACTGGGTCGAGAAGTTCGTGATCAGCTTTTTGATGATGTCTCTGC 
TTTGTTCCATGACCTTGACAATGGAATGCTTCCCATCAGTGCTTCCCATCAGTGT 
TCTCTTCCCATATCTCCCAATTCCAGCTCACCG 

SEQ ID 15 = Arabidopsis obtusifoliol C14a- 
15 demethylase (clone ID: ATA101105) amino acid translation 

HYRRAMTSHARVRKLGIRLELVHKKITFLIGPEVSAHFFKASESDLSQQE 
VYQFNVPTFGPGWFDVDYSVRQEQFGSSLRHLELTS 



in 20 SEQ ID 16 = Arabidopsis obtusifoliol C14a- 



e 1^ * 

•a 
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demethylase (clone ID: ATA202967) nucleotide sequence 



ii . 

fU 

PI 



_ TCGACCCCGCGTCCGCGGACGCGTGGGATCAGCTTCAAGCTTAAGAGAGC 
E TTCGAAAGCGAAAGCGACGATTTCTTCTCCATCGTGAGAGCAAATCTCCAGAGCC 
O 25 GTTTTCTCTTCTTCTTCTTCCTCCTCGCGCCGTCTCTGAAACTCCATCATCGTAT 

CAATCAAATTGCTTCCTCCTCCAAATTGAAAAACAATGGAATTGGATTCGGAGAA 
CAAATTGTTGAAGACGGGTTTGGTTATAGTGGCGACACTTGTTATAGCCAAACTC 
ATCTTCTCTTTCTTCACTTCTGATTCTAAGAAGAAGCGTCTTCCTCCTACTCTTA 
AAGCTTGGCCTCCATTGGTTGGAAGTCTTATCAAATTCTTGAAAGGACCTATTAT 
30 TATGCTTAGAGAGGAATACCCTAAGCTTGGAAGTGTGTTTACTGTTAATCTTGTT 
CACAAAAAGATTACTTTTCTTATTGGTCCTGAAGTCTCTGCTCATTTTTTCAAAG 
CTTCTGAATCTGATCTTAGTCAGCAGGAAGTGTATCAGTTCAATGTCCCTACTTT 
TGGTCCTGGAGTTGTTTTCGATGTTGATTATTCTGTTCGTCAGGAGCAGTTTCGG 
TTCTTCACTGAGGCACTTAGAGTTAACAAGTTGAAGGGTTATGTGGATATGATGG 
35 TTACTGAAGCTGAGGATTACTTCTCTAAATGGGGAGAGAGTGGTGAAGTTGATAT 
TAAGGTTGAGCTAGAGAGGCTCATCATCTTGACTGCAAGTAGATGTTTACTGGGT 
CGAGAAGTTCGTGATCAGCTTTTTGATGATGTCTCTGCTTTGTTCCATGACCTTG 
ACAATGGAATGCTTCCCATCAGTGTTCTCTTCCCATATCTCCCAATTCCAGCTCA 
CCGCCGTCGTGACCGTGCCCGAGAAAAGCTTTCGGAGATTTTCGCAAAAATCATT 
40 GGGTCGAGAAAACGCTCTGGAAAAACAGAGAACGACATGCTGCAGTGTTTCATCG 
AATCAAAGTACAAAGATGGTAGACAGACAACCGAATCTGAAGTCACTGGTTTGCT 
CATTGCTGCTCTGTtTGCAGGACAACACACGAGCTCTATCACTTCCACCTGGACC 
GGTGCTTATCTGATGCGATACAAAGAGTACTTCTCAGCTGCTCTTGATGAGCAGA 
AGAACCTGATTGCGAAACATGGAGACAAGATCGATCATGATATCTTATCCGAGAT 
45 GGATGTTCTCTACCGCTGCATTAAGGAAGCGTTGAGGCTTCACCCTCCACTCATC 
ATGTTAATGAGAGCCTCGCACAGTGATTTCAGCGTGACAGCTCGGGATGGAAAAA 
CTTACGATATCCCAAAGGGTCACATCGTTGCAACCTCCCCTGCATTTGCCAACCG 
CTTACCGCACATCTTCAAAGACCCCGACACCTACGACCCAGAAAGATTCTCCCCT 
GGAAGAGAAGAGGACAAAGCCGCAGGGGCATTCTCGTACATTGCATTCGGAGGGG 
50 GAAGGCACGGGTGCCTTGGAGAGCCGTTTGCTTACCTGCAGATCAAAGCCATATG 
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GAGTCATTTGTTGAGGAACTTCGAGCTTGAGCTAGTTTCACCGTTCCCTGAGATT 
GACTGGAACGCTATGGTGGTTGGAGTTAAAGGCAATGTGATGGTGCGTTACAAGA 
GGCgcCAGCTTTCTTAAAGACAAGTTTAAGGTTATTGCAGCTTTGGATTTTTCTC 
TCTGGTTTCTGCTTTGCTTTTGTCCCTCTCTGGTTTTAGTTTTGTTGTTGAATAA 
TTCTTCTGTTTTTATAAACTGTTGTTACTCTTTAATTGACATTTATTTTTAAGCT 
TCCTAAGTTTGTGGTTCAAAAAAAAAAAAAGGCGGCGTTACT 

SEQ ID 17 = Arabidopsis obtusifoliol C14a- 
demethylase (clone ID: ATA202967) amino acid translation 



10 
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MELDSENKLLKTGLVIVATLVIAKLIFSFFTSDSKKKRLPPTLKAWPPLVGSLIK 
FLKGPIIMLREEYPKLGSVFTVNLVHKKITFLIGPEVSAHFFKASESDLSQQEVY 
QFNVPTFGPGWFDVDYSWQEQFRFFTEALRVNKLKGYVDMMVTEAEDYFSKWG 
ESGEVDIKVELERLIILTASRCLLGREVRDQLFDDVSALFHDLDNGMLPISVLFP 
15 YLP I PAHRRRDRAREKLSE I FAKI I GSRKRSGKTENDMLQCF I ES KYKDGRQTTE 
SEVTGLLIAALFAGQHTSSITSTWTGAYLMRYKEYFSAALDEQKNLIAKHGDKID 
HDILSEMDVLYRCIKEALRLHPPLIMLMRASHSDFSVTARDGKTYDIPKGHIVAT 
SPAFANRLPHIFKDPDTYDPERFSPGREEDKAAGAFSYIAFGGGRHGCLGEPFAY 
LQI KAI WSHLLRNFELELVS PF PE I DWNAMWGVKGNVMVRYKRRQLS * 

20 
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SEQ ID 18 = Arabidopsis obtusifoliol C14a- 
fU demethylase (clone ID: ATA403931) nucleotide sequence 

£ TCGACCCCGCGTCCGCGGACGCGTGGGATCAGCTTCAAGCTTAAGAGAGC 
O 25 TTCGAAAGCGAAAGCGACGATTTCTTCTCCATCGTGAGAGCAAATCTCCAGAGCC 

GTTTTCTCTTCTTCTTCTTCCTCCTCGCGCCGTCTCTGAAACTCCATCATCGTAT 
CAATCAAATTGCTTCCTCCTCCAAATTGAAAAACAATGGAATTGGATTCGGAGAA 
CAAATTGTTGAAGACGGGTTTGGTTATAGTGGCGACACTTGTTATAGCCAAACTC 
ATCTTCTCTTTCTTCACTTCTGATTCTAAGAAGAAGCGTCTTCCTCCTACTCTTA 
30 AAGCTTGGCCTCCATTGGTTGGAAGTCTTATCAAATTCTTGAAAGGACCTATTAT 
TATGCTTAGAGAGGAATACCCTAAGCTTGGAAGTGTGTTTACTGTTAATCTTGTT 
CACAAAAAGATTACTTTTCTTATTGGTCCTGAAGTCTCTGCTCATTTTTTCAAAG 
CTTCTGAATCTGATCTTAGTCAGCAGGAAGTGTATCAGTTCAATGTCCCTACTTT 
TGGTCCTGGAGTTGTTTTCGATGTTGATTATTCTGTTCGTCAGGAGCAGTTTCGG 
35 TTCTTCACTGAGGCACTTAGAGTTAACAAGTTGAAGGGTTATGTGGATATGATGG 
TTACTGAAGCTGAGGATTACTTCTCTAAATGGGGAGAGAGTGGTGAAGTTGATAT 
TAAGGTTGAGCTAGAGAGGCTCATCATCTTGACTGCAAGTAGATGTTTACTGGGT 
CGAGAAGTTCGTGATCAGCTTTTTGATGATGTCTCTGCTTTGTTCCATGACCTTG 
ACAATGGAATGCTTCCCATCAGTGTTCTCTTCCCATATCTCCCAATTCCAGCTCA 
40 CCGCCGTCGTGACCGTGCCCGAGAAAAGCTTTCGGAGATTTTCGCAAAAATCATT 
GGGTCGAGAAAACGCTCTGGAAAAACAGAGAACGACATGCTGCAGTGTTTCATCG 
AATCAAAGTACAAAGATGGTAGACAGACAACCGAATCTGAAGTCACTGGTTTGCT 
CATTGCTGCTCTGTtTGCAGGACAACACACGAGCTCTATCACTTCCACCTGGACC 
GGTGCTTATCTGATGCGATACAAAGAGTACTTCTCAGCTGCTCTTGATGAGCAGA 
45 AGAACCTGATTGCGAAACATGGAGACAAGATCGATCATGATATCTTATCCGAGAT 
GGATGTTCTCTACCGCTGCATTAAGGAAGCGTTGAGGCTTCACCCTCCACTCATC 
ATGTTAATGAGAGCCTCGCACAGTGATTTCAGCGTGACAGCTCGGGATGGAAAAA 
CTTACGATATCCCAAAGGGTCACATCGTTGCAACCTCCCCTGCATTTGCCAACCG 
CTTACCGCACATCTTCAAAGACCCCGACACCTACGACCCAGAAAGATTCTCCCCT 
50 GGAAGAGAAGAGGACAAAGCCGCAGGGGCATTCTCGTACATTGCATTCGGAGGGG 
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GAAGGCACGGGTGCCTTGGAGAGCCGTTTGCTTACCTGCAGATCAAAGCCATATG 
GAGTCATTTGTTGAGGAACTTCGAGCTTGAGCTAGTTTCACCGTTCCCTGAGATT 
GACTGGAACGCTATGGTGGTTGGAGTTAAAGGCAATGTGATGGTGCGTTACAAGA 
GGCgcCAGCTTTCTTAAAGACAAGTTTAAGGTTATTGCAGCTTTGGATTTTTCTC 
5 TCTGGTTTCTGCTTTGCTTTTGTCCCTCTCTGGTTTTAGTTTTGTTGTTGAATAA 
TTCTTCTGTTTTTATAAACTGTTGTTACTCTTTAATTGACATTTATTTTTAAGCT 
TCCTAAGTTTGTGGTTCAAAAAAAAAAAAAGGCGGCGTTACT 

SEQ ID 19 = Arahidopsis obtusifoliol C14a- 
10 demethylase (clone ID: ATA403931) amino acid translation 

MELDSENKLLKTGLVIVATLVIAKLIFSFFTSDSKKKRLPPTLKAWPPLVGSLIK 
FLKGPIIMLREEYPKLGSVFTVNLVHKKITFLIGPEVSAHFFKASESDLSQQEVY 
QFNVPTFGPGWFDVDYSWQEQFRFFTEALRWKIjKGYVDMMVTEAEDYFSKW^ 
1 5 E SGE VD I KVELERL I I LTASRCLLGREVRDQLFDDVS ALFHDLDNGMLP I S VLF P 

ylpipahrrrdrareklseifakiigsrkrsgktendmlqcfieskykdgrqtte 
sevtglliaalfagqhtssitstwtgaylmrykeyfsaaldeqknliakhgdkid 
hdilsemdvlyrcikealrlhpplimlmrashsdfsvtardgktydipkghivat 
spafanrlphifkdpdtydperfspgreedkaagafsyiafgggrhgclgepfay 
20 lq i kai wshllrnfelelvs pfpe i dwnamwgvkgnvmvrykrrqls * 



fy SEQ ID 20 = Arahidopsis sterol methyl transferase 

y I protein sequence (Accession NO: U71400) See Appendix 

s D 
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SEQ ID 21 = Tobacco sterol methyl transferase I 
protein sequence (from Prof. Pierre Benveniste 
Accession NO: U81312) See Appendix E 

30 SEQ ID 22 = Arahidopsis sterol methyl transferase 

II protein sequence (Accession NO: X89867) (from Prof. 
Pierre Benveniste) See Appendix F 

SEQ ID 23 = Arahidopsis sterol C5-desaturase 
35 protein sequence (Accession NO: X90454) See Appendix G 

SEQ ID 24 = Rubber truncated HMGRlml {3566 to A) 
nucleotide sequence 

ATGGCACGCGCCTCCCATGACGTGTGGGACCTCGAAGATACGGATCCCAACTACC 
40 TCATCGATGAAGATCACCGTCTCGTTACTTGCCCTCCCGCTAATATATCTACTAA 
GACTACCATTATTGCCGCACCTACCAAATTGCCTACCTCGGAACCCTTAATTGCA 
CCCTTAGTCTCGGAGGAAGACGAAATGATCGTCAACTCCGTCGTGGATGGGAAGA 
TACCCTCCTATTCTCTGGAGTCGAAGCTCGGGGACTGCAAACGAGCGGCTGCGAT 
TCGACGCGAGGCTTTGCAGAGGATGACAAGGAGGTCGCTGGAAGGCTTGCCAGTA 
45 GAAGGGTTCGATTACGAGTCGATTTTAGGACAATGCTGTGAAATGCCAGTGGGAT 
ACGTGCAGATTCCGGTGGGGATTGCGGGGCCGTTGTTGCTGAACGGGCGGGAGTA 
CTCTGTTCCAATGGCGACCACGGAGGGTTGTTTGGTGGCGAGCACTAATAGAGGG 
TGTAAGGCGATTTACTTGTCAGGTGGGGCCACCAGCGTCTTGTTGAAGGATGGCA 
TGACAAGAGCGCCTGTTGTAAGATTCGCGTCGGCGACTAGAGCCGCGGAGTTGAA 
50 GTTCTTCTTGGAGGATCCTGACAATTTTGATACCTTGGCCGTAGTTTTTAACAAG 
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TCCAGTAGATTTGCGAGGCTCCAAGGCATTAAATGCTCAATTGCTGGTAAGAATC 
TTTATATAAGATTCAGCTGCAGCACTGGCGATGCAATGGGGATGAACATGGTTTC 
TAAAGGGGTTCAAAACGTTCTTGAATTTCTTCAAAGTGATTTTTCTGATATGGAT 
GTCATTGGAATCTCAGGAAATTTTTGTTCGGATAAGT^GCCTGCTGCTGTAAATT 
5 GGATTGAAGGACGTGGCAAATCAGTTGTTTGTGAGGCAATTATCAAGGAAGAGGT 
GGTGAAGAAGGTGTTGAAAACCAATGTGGCCTCCCTAGTGGAGCTTAACATGCTC 
AAGAATCTTGCTGGTTCTGCTGTTGCTGGTGCTTTGGGTGGATTTAATGCCCATG 
CAGGCAACATCGTATCTGCAATCTTTATTGCCACTGGCCAGGATCCAGCACAGAA 
TGTTGAGAGTTCTCATTGCATTACCATGATGGAAGCTGTCAATGATGGAAAGGAT 
10 CTCCATATCTCTGTGACCATGCCCTCCATTGAGGTGGGTACAGTCGGAGGTGGAA 
CTCAACTTGCATCTCAGTCTGCTTGTCTCAATTTGCTTGGGGTGAAGGGTGCAAA 
CAAAGAGTCGCCAGGATCAAACTCAAGGCTCCTTGCTGCCATCGTAGCTGGTTCA 
GTTTTGGCTGGTGAGCTCTCCTTGATGTCTGCCATTGCAGCTGGGCAGCTTGTCA 
AGAGTCACATGAAGTACAACAGAGCCAGCAAAGATATGTCTAAAGCTGCATCTTA 
15 G 

SEQ ID 25 = Rubber truncated HMGRlml (S566 to A) 
amino acid translation 
^ MARASHDVWDLEDTDPNYLIDEDHRLVTCPPANISTKTTIIAAPTKLPTSEPLIA 
[4 20 PLVSEEDEMIVNSVVDGKIPSYSLESKLGDCKRAAAIRREALQRMTRRSLEGLPV 

tj EGFDYESILGQCCEMPVGYVQIPVGIAGPLLLNGREYSVPMATTEGCLVASTNRG 
m CKAIYLSGGATSVLLKDGMTRAPWRFASATRAAELKFFLEDPDNFDTLAWFNK 

S S R F ARLQG I KG S I AGKNL Y I RF S C S TGD AMGMNMVS KG VQNVLE F LQ S D F S DMD 
VIGI SGNFCSDKXPAAVNWIEGRGKS WCEAI IKEEWKKVLKTNVASLVELNML 
25 KNLAGSAVAGALGGFNAHAGNIVSAI FIATGQDPAQNVESSHCITMMEAVNDGKD 
LHISVTMPSIEVGTVGGGTQLASQSACLNLLGVKGANKESPGSNSRLLiAAIVAGS 
VliAGELSLMSAIAAGQLVKSHMKYNRASKDMSKAAS 



t f 



^" 30 



SEQ ID. 26 = Rubber truncated HMGRlm2 (S567 to A) 
nucleotide sequence 

ATGGCACGCGCCTCCCATGACGTGTGGGACCTCGAAGATACGGATCCCAACTACC 
TCATCGATGAAGATCACCGTCTCGTTACTTGCCCTCCCGCTAATATATCTACTAA 

35 GACTACCATTATTGCCGCACCTACCAAATTGCCTACCTCGGAACCCTTAATTGCA 
CCCTTAGTCTCGGAGGAAGACGAAATGATCGTCAACTCCGTCGTGGATGGGAAGA 
TACCCTCCTATTCTCTGGAGTCGAAGCTCGGGGACTGCAAACGAGCGGCTGCGAT 
TCGACGCGAGGCTTTGCAGAGGATGACAAGGAGGTCGCTGGAAGGCTTGCCAGTA 
GAAGGGTTCGATTACGAGTCGATTTTAGGACAATGCTGTGAAATGCCAGTGGGAT 

40 ACGTGCAGATTCCGGTGGGGATTGCGGGGCCGTTGTTGCTGAACGGGCGGGAGTA 
CTCTGTTCCAATGGCGACCACGGAGGGTTGTTTGGTGGCGAGCACTAATAGAGGG 
TGTAAGGCGATTTACTTGTCAGGTGGGGCCACCAGCGTCTTGTTGAAGGATGGCA 
TGACAAGAGCGCCTGTTGTAAGATTCGCGTCGGCGACTAGAGCCGCGGAGTTGAA 
GTTCTTCTTGGAGGATCCTGACAATTTTGATACCTTGGCCGTAGTTTTTAACAAG 

45 TCCAGTAGATTTGCGAGGCTCCAAGGCATTAAATGCTCAATTGCTGGTAAGAATC 
TTTATATAAGATTCAGCTGCAGCACTGGCGATGCAATGGGGATGAACATGGTTTC 
TAAAGGGGTTCAAAACGTTCTTGAATTTCTTCAAAGTGATTTTTCTGATATGGAT 
GTCATTGGAATCTCAGGAAATTTTTGTTCGGATAAGAAGCCTGCTGCTGTAAATT 
GGATTGAAGGACGTGGCAAATCAGTTGTTTGTGAGGCAATTATCAAGGAAGAGGT 

50 GGTGAAGAAGGTGTTGAAAACCAATGTGGCCTCCCTAGTGGAGCTTAACATGCTC 
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AAGAATCTTGCTGGTTCTGCTGTTGCTGGTGCTTTGGGTGGATTTAATGCCCATG 
CAGGCAACATCGTATCTGCAATCTTTATTGCCACTGGCCAGGATCCAGCACAGAA 
TGTTGAGAGTTCTCATTGCATTACCATGATGGAAGCTGTCAATGATGGAAAGGAT 
CTCCATATCTCTGTGACCATGCCCTCCATTGAGGTGGGTACAGTCGGAGGTGGAA 
CTCAACTTGCATCTCAGTCTGCTTGTCTCAATTTGCTTGGGGTGAAGGGTGCAAA 
CAAAGAGTCGCCAGGATCAAACTCAAGGCTCCTTGCTGCCATCGTAGCTGGTTCA 
GTTTTGGCTGGTGAGCTCTCCTTGATGTCTGCCATTGCAGCTGGGCAGCTTGTCA 
AGAGTCACATGAAGTACAACAGATCCGCCAAAGATATGTCTAAAGCTGCATCTTA 
G 



SEQ ID 27 = Rubber truncated HMGRlm2 (S567 to A) 
amino acid translation 

MARASHDVWDLEDTDPNYLIDEDHRLVTCPPANISTKTTIIAAPTKLPTSEPLIA 
15 PLVSEEDEMIVNSVVDGKIPSYSLESKLGDCKRAAAIRREALQRMTRRSLEGLPV 
EGFDYESILGQCCEMPVGYVQIPVGIAGPLLLNGREYSVPMATTEGCLVASTNRG 
CKAIYLSGGATSVLLKDGMTRAPWRFASATRAAELKFFLEDPDNFDTLAWFNK 
SSRFARLQGIKCSIAGKNLYIRFSCSTGDAMGMNMVSKGVQNVLEFLQSDFSDMD 
VIGISGNFCSDKKPAAVNWIEGRGKSWCEAIIKEEWKKVLKTNVASLVELNML 
20 KNLAGSAVAGALGGFNAHAGNIVSAI FI ATGQDPAQNVESSHCITMMEAVNDGKD 
LH I S VTMP S I E VGTVGGGTQLAS Q S ACLNLLGVKGANKE S PGSNS RLLAAI VAGS 
VliAGELSLMSAIAAGQLVKSHMKYNRSAKDMSKAAS 

Example 6 . Ar^hidopsis obtusifoliol C14a- 
25 demethylase constructs 

The Arahidopsis obtusifoliol C14a-demethylase gene 
was amplified from two separate Arahidopsis mRNA 
samples (SIN 2 and Keto-10) through use of primers 
BXK33 and BXK34, as described below. 

30 

BXK33 (SEQ ID 28): 5 ' -GAGATCTGAACCCTAACGAGAG- 3 ' 
BXK34 (SEQ ID 29): 5 ' - GGAGCTCTTAAGAAAAGGGACGACGC - 3 ' 

35 The primer BXK33 has a Bgl II cleavage site shown 

in bold. The primer BXK34 has a Sac I cleavage site 
shown in bold. The actual size of the structural gene 
is 1.445 Kb. 

The Arahidopsis mRNA was amplified using a Perkin 
40 Elmer GeneAmp RT-PCR kit. The reverse transcription 

reaction used 25 mM MgClj (4 |il ; 5 mM final), lOX PGR 
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buffer (2 |al) , di DEPC water (1 \xl) , 2 each of 1 mM 
solution of each of four dNTPs (dGTP, dATP, dUTP, 
dCTP) , RNase inhibitor (1 (4.1 of 10 units per |al stock) , 
MMLV reverse transcriptase (1 )il of a 2.5 U/|il stock), 

Oligo d(T)16 Primer (1 pil of a 2 . 5 jiM stock), and 2 [0.1 
of an Arabidopsis polyA RNA sample. The reaction mix 
was incubated at room temperature (about 2 0°C) for 10 
minutes, then in a PGR machine for one cycle (15 min. 
at 42°C, 5 min. at SS^'C and 5 min, at 4°C) . 

Separate primer-mediated amplification reactions 
were carried out using Taq DNA polymerase and Vent DNA 

polymerase to obtain Arahidopsis obtusifoliol C14a- 
demethylase cDNA from the amplified mRNA sample. 



Taq PGR Reaction 


Vent PGR Reaction 


4 jil 25 mM MgGls 


4 111 2 5 mM MgGls 


8 |al lOX PGR buffer II 


8 10.1 lOX Vent PGR buffer 


65.5 |al di DEPG water 


65,. 5 [xl di DEPG water 


0.5 i-il AmpliTaq polymerase 


0.5 \xl Vent polymerase 



After 1 minute and 3 5 seconds at 95°G, 1 |il each of 

15 i^M stocks of the upstream and downstream primers 
(BXK3 3 and BXK34) were added to the PGR reaction (100 
111 total PGR reaction volume) and the PGR reaction 
solutions were subjected to 35 cycles (95°G for 15 
seconds, then 60°G for 30 seconds) . The amplified PGR 
reaction was then maintained at 72°G for 7 minutes and 
then stored at 4°G . An amplification positive control 
reaction was carried out under the same conditions with 
DM151 and DM152 primers. 



DM151 (SEQ ID 30): 5 ' -GTGTGTGAATGAGAAATGGTTGTATG- 3 ' 
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DM152 (SEQ ID 31): 5 ' - CATGTCAAATTTCACTGCTTCATCC - 3 ' 

Electrophoresis of the nucleic acid solutions 
after PGR amplification displayed an amplification 
product corresponding approximately to the size of the 
desired 1.445 Kb structural gene. The fragment was 
cloned into an M13 vector. A representative sequencing 
reaction consisted of: 10 |al of plasmid DNA (200-500 
ng) , 2 |j.l of M13 Forward or Reverse primer (15 
picomoles) and 8 \xl of Big Dye Terminator Reaction Mix 
(PE Applied Biosci . ) . The clone copy of ATA101105 was 
called CPR17398. The sequence of the selected clone 
(Arabidopsis obtusifoliol C14a-demethylase) is 
identified as SEQ ID NO: 9. 

The predicted polypeptide sequence for the cloned 

Arabidopsis obtusifoliol C14a-demethylase sequence was 
subjected to a BLAST search in the public database and 
found to align with the sorghum obtusifoliol 14 -alpha 
demethylase polypeptide (ATA101105/U743 19/gl658192 ; and 
gl216657/U74319) exhibiting 75-78% sequence identity 
and 87-90% sequence homology. The cloned nucleic acid 
encoding Arabidopsis obtusifoliol C14a-demethylase (SEQ 
ID No: 9) is missing the 5' end. 

The 5' terminal portion of the structural gene was 
obtained by the RACE (Rapid Amplification of cDNA Ends) 
PGR using primers BXK39 and BXK40 per manufacturer's 
instructions (Clontech) . 

BXK3 9 (SEQ ID 32): 5 ' -GAGATCTGGAGAGATTTAAAGAAGGGTAAGG- 3 ' 
BXK4 0 (SEQ ID 33): 5'- GGAGCTCGGTTTTTAAGAAAAGGGAGGAGGC - 3 ' 

The cloned nucleic acid encoding full length 
Arabidopsis obtusifoliol C14a-demethylase is identified 
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as SEQ ID No: 8. The amplified Arahidopsis obtusifoliol 
C14a-demethylase structural gene is useful for making 
constructs that express Arahidopsis obtusifoliol C14a- 
demethylase in transgenic plants. 

In light of the detailed description of the 
invention and the examples presented above, it can be 
appreciated that the several aspects of the invention 
are achieved. 

It is to be understood that the present invention 
has been described in detail by way of illustration and 
example in order to acquaint others skilled in the art 
with the invention, its principles, and its practical 
application. Particular formulations and processes of 
the present invention are not limited to the 
descriptions of the specific embodiments presented, but 
rather the descriptions and examples should be viewed 
in terms of the claims that follow and their 
equivalents; While some of the examples and 
descriptions above include some conclusions about the 
way the invention may function, the inventor does not 
intend to be bound by those conclusions and functions, 
but puts them forth only as possible explanations. 

It is to be further understood that the specific 
embodiments of the present invention as set forth are 
not intended as being exhaustive or limiting of the 
invention, and that many alternatives, modifications, 
and variations will be apparent to those of ordinary 
skill in the art in light of the foregoing examples and 
detailed description. Accordingly, this invention is 
intended to embrace all such alternatives, 
modifications, and variations that fall within the 
spirit and scope of the following claims. 



